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Field of the invention 

The present invention relates generally to means and methods of diagnosing and 
treating the phenotypic spectrum as well as the overlapping clinical characteristics 
with several forms of inherited abnormal expression and/or function of the 
cytochrome P-450 (CYP)3A4 and CYP3A7 genes. In particular, the present invention 
relates to polynucleotides of molecular variant CYP3A4 and CYP3A7 genes which, 
for example, are associated with abnormal drug response or individual predisposition 
to several common cancers caused by environmental carcinogens, and to vectors 
comprising such polynucleotides. Furthermore, the present invention relates to host 
cells comprising such polynucleotides or vectors and their use for the production of 
variant CYP3A4 and CYP3A7 proteins. In addition, the present invention relates to 
variant CYP3A4 and CYP3^7 proteins and antibodies specifically recognizing such 
proteins. The present invention also concerns transgenic non-human animals 
comprising the above-described polynucleotide or vectors. Moreover, the present 
invention relates to methods for identifying and obtaining drug candidates and 
inhibitors for therapy of disorders related to the malfunction of the CYP3A4 and 
CYP3A7 genes as well as to methods of diagnosing the status of such disorders. The 
present invention furthermore provides pharmaceutical and diagnostic compositions 
comprising the above-described polynucleotides, vectors, proteins, antibodies, and 
drugs and inhibitors obtainable by the above-described method. Said compositions 
are particularly useful for diagnosing and treating various diseases with drugs that 
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are substrates, inhibitors or modulators of the CYP3A4 and CYP3A7 genes or their 
product. 

Several documents are cited throughout the text of this specification. Each of the 
documents cited herein (including any manufacturer's specifications, instructions, 
etc.) are hereby incorporated herein by reference; however, there is no admission 
that any document cited is indeed prior art as to the present invention. 

Background of the invention 

IVIembers of the cytochrome P-450 (CYP) family of hemoproteins metabolise a wide 
variety of endogenous substrates such as steroid hormones, and of xenobiotics 
including carcinogens, toxins and drugs(r,2). Of the human CYP proteins, those of 
the CYP3A subfamily are of a major importance, since collectively, they are by far the 
most abundant of all the human CYP isoforms. Moreover, their substrate specificity is 
extremely broad; accordingly, many structurally diverse compounds are, exclusively 
or to some extent, substrates for CYP3A proteins. Based on the data available it is 
generally assumed that all CYP3A isoforms have similar substrate spectra; however, 
limited studies indicate the possibility of differences (3). All CYP3A isoforms are 
localized in organs of particular importance to drug disposition (gastrointestinal tract, 
kidney and liver). 

At least three functional CYP3A proteins exist in humans. The CYP3A4 
monooxygenase is the predominant cytochrome P450 in human liver and small 
bowel. The protein displays a broad substrate specificity and it metabolises more 
than 60% of ail drugs that are currently in use, including contraceptive steroids, 
antidepressants, benzodiazepines, immunosuppressive agents, imidazole 
antimicotics, and macrolide antibiotics (4,5). In addition, CYP3A4 plays a major role 
in the protection from environmental toxins. For example, the protein metabolizes 
aflatoxin B1, which has been implicated in the etiology of liver cancer, which is a 
major cause of premature death in many areas of Africa and Asia. Aflatoxin B1 is a 
mycotoxin produced by species of Aspergillus, and human exposure results 
principally from the ingestion of stored foodstuffs contaminated with the mold. 
Carcinogenicity is associated with its conversion to 8,9-oxide by the hepatic 
cytochrome P450-dependent monooxygenase system. Forrester et al. (5) found that 
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the rates of metabolic activation of aflatoxin B1 were highly correlated with the level 
of proteins of the CYP3A gene family In the microsomes. Furthermore, Paolini et al. 
(7) found significant increases in CYP3A in the lungs of rats treated with high doses 
of beta-carotene. Consequently, it was proposed that correspondingly high levels of 
CYP3A4 in humans would predispose an individual to cancer risk from the 
bioactivated tobacco-smoke procarcinogens, thus explaining the cocarcinogenic 
effect of beta-carotene in smokers. All this implies that individual variation in the 
CYP3A4 activity could influence the efficacy of a variety of drug therapies as well as 
the individual predisposition to several major cancers caused by environmental 
carcinogens. 

A considerable variation in the CYP3A4 content and catalytic activity has been, 
indeed, described in the general population. For example, the metabolic clea'rance of 
the gene substrates exhibits a unimodal distribution with up to 20-fold interindivldual 
variability. The activities of the CYP3A4 protein in liver biopsies vary up to 30-fold (8). 
Furthermore, many common drugs alter the expression levels of the gene (induction 
or repression) and the extent of this phenomenon is individually variable. The 
inducers of CYP3A4 expression include commonly used drugs such as the 
glucocorticoid dexamethasone, the antibiotic rifampicin, and the antimycotic 
clotrimazole. The inducibility of CYP3A4 expression, combined with the diverse 
renge of substrates, creates a potential for potentially harmful drug interactions 
involving this isozyme in patients undergoing therapies with multiple drugs. 

CYP3A3 is a very closely related isoform to CYP3A4 (>more than 98% cDNA 
sequence similarity), but it is not known whether this reflects a separate gene product 
or an allelic variant. By contrast, CYP3A5 is a gene distinct from CYP3A4 and it is 
expressed polymorphically both in the adult and fetal liver and in the kidney and 
intestine. In adult Caucasians, the mRNA and the protein were detected in the liver of 
10 to 30% of samples, while the protein was detected in the kidney and intestine of 
70% of subjects (Ref. (9) and references therein). A point mutation described in the 
CYP3A5 gene which possibly results in the synthesis of an unstable protein, may 
account for the polymorphic expression of this enzyme (9). CYP3A7 is the third 
functional CYP3A isoform. CYP3A7 was originally isolated from a fetal liver but it was 
subsequently found in 54% of adult livers (10). 
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Tests to estimate the inducibility and the activity of CYP3A Isozymes in an individual 
patient would be of obvious relevance for the optimization of therapies with drugs 
which are their substrates, and for the prevention of the associated side effects. 
Direct estimates of the activities of CYP3A isozymes In liver biopsies are possible but 
impracticable for both ethical and cost reasons. The indirect in vivo tests of CYP3A4 
activity such as the erythromycin breath test or the 6-B-hydroxycortisol test pose 
ethical problems such as the Invasive admininstration of undesirable probe 
substances and they are obviously unsuited for routine testing. In addition, the lack of 
correlation between these tests questions their Informative value regarding the 
CYP3A4 activity (7 7). 

A major portion (83%) of the interindividual CYP3A4 variability has been attributed to 
genetic facors {12). The establishment of a genetic test for the activity of'CYP3A4 
and of the other CYP3A Isozymes should be possible, assuming the prior 
identification of those factors. Genetic variance affecting the activity and the 
expression of CYP3A isozymes could be localized In the genes itself, or In one or 
more of their regulators. A comparison of the three originally published sequences of 
the best characterized CYP3A gene, CYP3A4, suggested the existence of 
polymorphisms affecting the amino acid sequence of the CYP3A4 protein (13). 
Unfortunately, this obsen/ation has not been, to our knowledge, confirmed In the 
general population. More recently, a polymorphism (CYP3A4-W) has been described 
in the nifediplne-specific response element of the CYP3A4 promoter (74). Its 
presence associates with a more advanced prostate tumor stage ( 74). Felix et al. 
(75) examined this polymorphism in 99 de novo and 30 treatment-related leukemlas. 
In all treatment-related cases, there was prior exposure to one or more anticancer 
drugs metabolized by CYP3A, such as epipodophyllotoxins. These data suggest that 
individuals with the CYP3A4-W polymorphism may be at increased risk for treatment- 
related leukemia and that epipodophyllotoxin metabolism by CYP3A4 may contribute 
to the secondary cancer risk. At present it Is unclear if the polymorphism influences 
the expressivity or inductibility of the CYP3A4 protein. A first published analysis 
suggests that it has no effect on the basal expression level of CYP3A4 (8). A point 
mutation was described in the CYP3A5 (9), whereas no mutations have been 
reported in CYP3A7. 
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Experiments with amino acid exchanges artificially introduced into the CYP3A4 gene 
indicate that the function of the family members may be quite sensitive to amino acid 
exchanges (16-21). Besides amino acid exchanges, silent polymorphisms and those 
localized in untranslated or intronic sequences also could influence the expression 
level of these genes. Alternatively, such polymorphisms could serve as markers for 
nearby, unidentified polymorphisms. This effect is known as linkage, i.e. defined 
polymorphisms serve as markers for phenotypes that they are not causative for. 

A major breakthrough in the understanding of the CYP3A expression and inducibility 
took place in 1998 when three research groups independently showed that the 
expression of CYP3A4 is regulated by a member of the orphan nuclear receptor 
family termed PXR (pregnane X receptor), or PAR (22-24). Upon treatment with 
inducers of CYP3A4, PXR binds to the rifampicin/dexamethasone response element 
in the CYP3A4 promoter as a heterodimer with the 9-cis retinoic acid receptor (RXR). 
Northern blot analysis detected most abundant expression of hPXR in liver, colon, 
and small intestine, i.e. in the major organs expressing CYP3A4. The available 
evidence suggests that human PXR serves as a key transcriptional regulator of the 
CYP3A4 gene. A recent report describes the induction of CYP3A7 mediated by PXR 
suggesting that , all members of the fanriily may be regulated by this common 
transcriptional activator (25). 

It is clear that naturally occurring mutations, if they exist can have effects on drug 
metabolization and efficacy of therapies with drugs, in particular in cancer treatment. 
It is unknown, however, how many of such variations exist, and with what frequency 
and at what positions in the human CYP3A4 and CYP3A7 genes. 
Accordingly, means and methods for diagnosing and treating a variety of forms of 
individual drug intolerability and Inefficacy of drug therapy which result from CYP3A4 
and/or CYP3A7 gene polymorphisms that interfere e.g., with chemotherapeutic 
treatment of diseases, in particular cancer, were hitherto not available but are 
nevertheless highly desirable. 



Thus, the technical problem of the present invention is to comply with the needs 
described above. 



6 

The solution to this technical problem is achieved by providing the embodiments 
characterized in the claims. 

Summary of the Invention 

The present invention is based on the finding of novel, so far unknown variations in 
the nucleotide sequences of the CYP3A4 and CYP3A7 genes and the population 
distribution of these alleles. Based upon the knowledge of these novel sequences 
diagnostic tests and reagents for such tests were designed for the specific detection 
and genotyplng of CYP3A4 and CYP3A7 alleles in humans, including homozygous 
as well as heterozygous, frequent as well as rare alleles of the CYP3A4 and CYP3A7 
genes. The determination of the CYP3A4 and/or CYP3A7 gene allele status of 
humans with such tests is useful for the optimization of therapies with the numerous 
substrates of CYP3A4 and CYP3A7. It may also be useful in the determination of the 
individual predisposition to several common cancers caused by environmental 
carcinogens. 

In a first embodiment, the invention provides polynucleotides of molecular variant 
CYP3A4 and CYP3A7 genes and embodiments related thereto such as vectors, host 
cells, variant CYP3A4 and CYP3A7 proteins and methods for producing the same. 
In yet another embodiment, the invention provides methods for identifying and 
obtaining drug candidates and inhibitors of CYP3A4 and CYP3A7 for therapy of 
disorders related to acquired drug hypo- or hypersensitivity as well as methods of 
diagnosing the status of such disorders. 

In a further embodiment, the invention provides pharmaceutical and diagnostic 
compositions comprising the above-described polynucleotides, vectors containing the 
same, proteins, antibodies thereto, and drugs and inhibitors obtainable by the above- 
described method. 

The pharmaceutical and diagnostic compositions, methods and uses of the invention 
are useful for the diagnosis and treatment of cancer and other diseases the therapy 
of which is dependent on drug treatment and tolerance. The novel variant forms of 
CYP3A4 and CYP3A7 genes according to the invention provide the potential for the 
development of a pharmacodynamic profile of drugs for a given patient. 



Description of the invention 
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The finding and characterization of variations in the CYP3A4 and CYP3A7 genes, 
and diagnostic tests for the discrimination of different CYP3A4 and CYP3A7 alleles in 
human individuals provide a very potent tool for improving the therapy of diseases 
with drugs that are targets of the CYP3A4 or CYP3A7 gene product, and whose 
metabolization is therefore dependent on CYP3A4 or CYP3A7. The diagnosis of the 
individual allelic CYP3A4 and CYP3A7 status permits a more focused therapy, e.g., 
by opening the possibility to apply individual dose regimens of drugs. It may also be 
useful as prognostic tool for therapy outcome. Furthermore, diagnostic tests .to 
genotype CYP3A4 and CYP3A7, and novel CYP3A4 and CYP3A7 variants, will not 
only improve therapy with established drugs and help to correlate genotypes with 
drug activity or side effects. These tests and sequences also provide reagents for the 
development of novel inhibitors that specifically modulate the activity of the individual 
types of CYP3A4 and CYP3A7. Expression in yeast, for example, of three allelic 
cDNAs encoding human liver CYP3A4 and methods for testing the binding properties 
and catalytic activities of their gene products have been described in (13). 

Thus, the present invention provides a novel way to exploit molecular biology and 
pharmalogical research for drug therapy while bypassing their potential detrimental 
effects which are due to expression of variant CYP3A4 and CYP3A7 genes. 

Accordingly, the invention relates to a polynucleotide selected from the group 
consisting of: 

(a) a polynucleotide having the nucleic acid sequence of SEQ ID NO: 54, 
55, 58, 59, 62, 63, 66, 67, 70, 71, 74, 75, 78, 79, 82. 83, 86, 87, 90, 91, 
94, 95, 98, 99, 102, 103, 106, 107. 110, 111, 118. 119, 122, 123, 126, 
127, 128, 134, 138, 144, 146, 148. 150. 151. 152, 153, 154, 156, 157. 
159. 161. 162, 163, 164 or 171; 

(b) a polynucleotide encoding a polypeptide having the amino acid 
sequence of any one of SEQ ID NO: 129, 135, 139, 145, 147, 155, 158, 
160 or 172; 
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(c) a polynucleotide encoding a CYP3A4 or CYP3A7 polypeptide, wherein 
said polynucleotide is having at a position corresponding to any one of 
position 6004, 13908, 14292, 14304, 14323, 14329. 14357, 15753, 
20230, 21867. 21868. 21896. 22026. 22041, 23081. 23172, 25925 or 
25958 of the CYP3A4 gene (Accession No: AF280107, whereby the 
nucleotide A of the first ATG encoding the CYP3A4 protein has been 
taken as position 1 ) or at a position corresponding to position 1 229 of 
the CYP3A7 (Accession No: gi4503232) a nucleotide exchange, a 
nucleotide deletion, an additional nucleotide or an additional nucleotide 
and a nucleotide exchange, wherein said nucleotide deletion at a 
position corresponding to position 23172 Is not resulting in an M to T 
amino acid substitution or is not a T to C nucleotide exchange; 

(d) a polynucleotide encoding an CYP3A4 or CYP3A7 polypeptide" wherein 
said polynucleotide is having at a position corresponding to any one of 
position 6004, 13908, 14292, 20230 or 21868 of the CYP3A4 gene 
(Accession No: AF280107, whereby the nucleotide A of the first ATG 
encoding the CYP3A4 protein has been taken as position 1) an A, at a 
position corresponding to any one of position 14323, 14329, 21867, 
21896, 22026, 22041, 23081 or 25925 of the CYP3A4 gene (Accession 
No: AF280107, whereby the nucleotide A of the first ATG encoding the 
CYP3A4 protein has been taken as position 1) a T, at a position 
corresponding to any one of position 14357, 15753 or 25958 of the 
CYP3A4 gene (Accession No: AF280107, whereby the nucleotide A of 
the first ATG encoding the CYP3A4 protein has been taken as position 
1) a G, at a position corresponding to any one of position 14304 of the 
CYP3A4 gene (Accession No: AF280107, whereby the nucleotide A of 
the first ATG encoding the CYP3A4 protein has been taken as position 
1) a C or at a position corresponding to position 1229 of the CYP3A7 
gene (Accession No: gi4503232) a G; 

(e) a polynucleotide encoding an CYP3A4 polypeptide, wherein said 
polypeptide comprises an amino acid substitution at any one of position 
56, 130. 170, 174, 363, 373, 416 or 445 of the CYP3A4 polypeptide 
(Accession No: AF280107), wherein said substitution at a position 
corresponding to position 445 is not M to T; and 
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(f) a polynucleotide encoding an CYP3A4 or CYP3A7 polypeptide, wherein 
said polypeptide comprises an amino acid substitution of G to D at 
position 56, R to Q at position 130, V to I at position 170, D to H at 
position 174, T to M at position 363, L to F at position 373 or P to L at 
position 416 of the CYP3A4 polypeptide (Accession No: AF280107) or 
T to R at position 409 of the CYP3A7 polypeptide (Accession No: 
gi4503232). 



In the context of the present Invention the term "molecular variant" CYP3A4 or 
CYP3A7 gene or protein as used herein means that said CYP3A4 or CYP3A7 gene 
or protein differs from the wild type CYP3A4 or CYP3A7 gene or protein by way of 
nucleotide substitution(s), addition(s) and/or deletion(s) (Genomic sequences of the 
CYP3A4, CYP3A7 gene are described, for example Bork, J Biol Chem 264 (1989), 
910-9; 

Hashimoto, Eur J Biochem 218 (1993), 585-95; Beaune, Proc Natl Acad. Sci USA 83 
(1986), 8064-8; Malowa, Proc Natl Acad Sci U S A 83 (1986), 5311-5; Accession 
numbers: Ml 4096, J04449, XI 2387, M18907. The numbering of the polymorphisms 
refers to the sequence Ml 4096; for CYP3A7: the reference sequence is described in 
Komori, J Biochem (Tokyo) 105 (1989), 161-3; Accession number: gi4503232. 
Preferably, said nucleotide substitution(s) result(s) in a corresponding change in the 
amino acid sequence of the CYP3A4 or CYP3A7 protein. 

The term "corresponding" as used herein means that a position is not only 
determined by the number of the preceding nucleotides and amino acids, 
respectively. The position of a given nucleotide or amino acid in accordance with the 
present invention which may be deleted, substituted or comprise one or more 
additional nucleotide(s) may vary due to deletions or additional nucleotides or amino 
acids elsewhere in the gene or the polypeptide. Thus, under a "corresponding 
position" In accordance with the present invention it is to be understood that 
nucleotides or amino acids may differ in the indicated number but may still have 
similar neighboring nucleotides or amino acids. Said nucleotides or amino acids 
which may be exchanged, deleted or comprise additional nucleotides or amino acids 
are also comprised by the term "corresponding position". Said nucleotides or amino 
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acids may for instance together with their neighbors form sequences which may be 
involved in the regulation of gene expression, stability of the corresponding RNA or 
RNA editing, as well as encode functional domains or motifs of the protein of the 
invention. 

In accordance with the present invention, the mode and population distribution of 
novel so far unidentified genetic variations in the CYP3A4 and CYP3A7 gene have 
been analyzed by sequence analysis of relevant regions of the human CYP3A4 and 
CYP3A7 genes from many different individuals. It is a well known fact that genomic 
DNA of individuals, which harbor the individual genetic makeup of all genes, 
including CYP3A4 and CYP3A7 can easily be purified from individual blood samples. 
These individual DNA samples are then used for the analysis of the sequence 
composition of the CYP3A4 and CYP3A7 gene alleles that are present in the 
individual which provided the blood sample. The sequence analysis was carried out 
by PGR amplification of relevant regions of the CYP3A4 and CYP3A7 genes, 
subsequent purification of the PGR products, followed by automated DNA 
sequencing with established methods (ABI dyeterminator cycle sequencing). 
One important parameter that had to be considered in the attempt to determine the 
individual GYP3A4 and/or GYP3A7 genotype and identify novel GYP3A4 or GYP3A7 
variants by direct DNA-sequencing of PGR-products from human blood genomic 
DNA is the fact that each human harbors (usually, with very few abnormal 
exceptions) two gene copies of each autosomal gene (diploidy). Because of that, 
'great care had to be taken in the evaluation of the sequences to be able to identify 
unambiguously not only homozygous sequence variations but also heterozygous 
variations. The details of the different steps in the identification and characterization 
of novel GYP3A4 and GYP3A7 gene polymorphisms (homozygous and 
heterozygous) are described in the examples below. 

Sequence data for some of the mutations in the GYP3A4 and GYP3A7 genes 
detected in accordance with the present invention are illustrated in Figure 4 
(indicated by an arrow). The methods of the mutation analysis followed standard 
protocols and are described in detail in the examples. In general such methods to be 
used in accordance with the present invention for evaluating the phenotypic spectrum 
as well as the overlapping clinical characteristics with other forms of drug 
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metabolization and altered tolerance to drugs in patients with mutations in the 
CYP3A4 or CYP3A7 gene encompass for example haplotype analysis, single-strand 
conformation polymorphism analysis (SSCA). PGR and direct sequencing. On the 
basis of thorough clinical characterization of many patients the phenotypes can then 
be correlated to these mutations as well as to mutations that had , been described 
earlier. 



As is evident to the person skilled in the art this new molecular genetic knowledge 
can now be used to exactly characterize the genotype of the index patient and of his 
family where a given drug takes an unusual effect. 

Over the past 20 years, genetic heterogeneity has been increasingly recognized as a 
significant source of variation in drug response. Many scientific communications 
(Meyer, Ann. Rev. Pharmacol. Toxicol. 37 (1997), 269-296 and West, J. Clin. 
Pharmacol. 37 (1997), 635-648) have clearly shown that some drugs work better or 
may even be highly toxic in some patients than in others and that these variations in 
patient's responses to drugs can be related to molecular basis. This 
"pharmacogenomic" concept spots correlations between responses to drugs and 
genetic profiles of patient's (Marshall, Nature Biotechnology, 15 (1997), 954-957; 
Marshall, Nature Biotechnology, 15 (1997), 1249-1252). 

In this context of population variability with regard to drug therapy, 
pharmacogenomics has been proposed as a tool useful in the identification and 
selection of patients which can respond to a particular drug without side effects. This 
identification/selection can be based upon molecular diagnosis of genetic 
polymorphisms by genotyping DNA from leukocytes in the blood of patient, for 
example, and characterization of disease (Bertz, Clin. Pharmacokinet. 32 (1997), 
210-256; Engel. J. Chromatogra. B, Biomed. Appl. 678 (1996), 93-103). For the 
providers of health care, such as health maintenance organizations in the US and 
government public health services in many European countries, this 
pharmacogenomics approach can represent a way of both improving health care and 
reducing overheads because there is a large cost to unnecessary therapies, 
ineffective drugs and drugs with side effects. 
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The mutations in the variant CYP3A4 and CYP3A7 genes sometime result in amino 
acid deletion(s), insertion(s) and in particular in substitution(s) either alone or in 
combination. It is of course also possible to genetically engineer such mutations in 
wild type genes or other mutant forms. Methods for introducing such modifications in 
the DNA sequence of CYP3A4 or CYP3A7 gene are well known to the person skilled 
in the art; see, e.g., Sambrook, Molecular Cloning A Laboratory Manual, Cold Spring 
Harbor Laboratory (1989) N.Y. 

In a preferred embodiment of the invention, the above described polynucleotide 
encodes a variant CYP3A4 or CYP3A7 protein or fragment thereof, e.g., comprising 
one or more epitopes of the amino acid sequence encoded by SEQ ID NOS: 125, 
135, 139, 145, 147, 149. 155. 158 or 160. 

For the investigation of the nature of the alterations in the amino acid sequence of 
the CYP3A4 and CYP3A7 proteins computer programs may be used such as 
BRASMOL that are obtainable from the Internet. Furthermore, folding simulations 
and computer redesign of structural motifs can be performed using other appropriate 
computer programs (Olszewski, Proteins 25 (1996). 286-299; Hoffman, Comput. 
Appl. Biosci. 11 (1995), 675-679). Computers can be used for the conformational and 
energetic analysis of detailed protein models (Monge, J. Mol. Biol. 247 (1995), 995- 
1012; Renouf, Adv. Exp. Med. Biol. 376 (1995), 37-45). These analysis can be used 
for the identification of the influence of a particular mutation on binding and/or 
metabolization of drugs. 

Usually, said amino acid deletion, addition or substitution in the amino acid sequence 
of the protein encoded by the polynucleotide of the invention is due to one or more 
nucleotide substitution, insertion or deletion, or any combinations thereof. Preferably 
said nucleotide substitution, insertion or deletion results in an amino acid substitution 
of Gly56 to Asp, Arg130 to Gin, Val170 to lie, Asp174 to His. Thr363 to Met, Leu373 
to Phe or Pro416 to Leu in the CYP3A4 gene and/or Thr409 to Arg in exon 1 1 of the 
CYP3A7 gene. 

The polynucleotide of the Invention may further comprise at least one nucleotide and 
optionally amino acid deletion, addition and/or substitution other than those specified 
hereinabove, for example those described in the prior art; e.g.,(13). This embodiment 
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of the present invention allows the study of synergistic effects of the mutations in the 
CYP3A4 or CYP3A7 gene on the pharmalogical profile of drugs in patients who bear 
such mutant forms of the gene or similar mutant forms that can be mimicked by the 
above described proteins. It is expected that the analysis of said synergistic effects 
provides deeper insights into drug tolerant or sensitive phenotypes of certain forms of 
cancer and other diseases. From said deeper insight the development of diagnostic 
and pharmaceutical compositions related to cancer will greatly benefit. 

Thus, in a preferred embodiment, the present Invention relates to polynucleotides of 
molecular variant CYP3A4 and CYP3A7 genes, wherein the nucleotide deletion, 
addition and/or substitution result in altered expression of the variant CYP3A4 or 
CYP3A7 gene compared to the corresponding wild type gene. 

The polynucleotide of the invention may be, e.g., DNA, cDNA, genomic DNA, RNA or 
synthetically produced DNA or RNA or a recombinantly produced chimeric nucleic 
acid molecule comprising any of those polynucleotides either alone or in 
combination. Preferably said polynucleotide is part of a vector, particularly plasmids, 
cosmids, viruses and bacteriophages used conventionally in genetic engineering that 
comprise a polynucleotide of the invention. Such vectors may comprise further genes 
such as marker genes which allow for the selection of said vector in a suitable host 
cell and under suitable conditions. 

In a further preferred embodiment of the vector of the invention, the polynucleotide of 
the invention is operativeiy linked to expression control sequences allowing 
expression in prokaryotic or eukaryotic cells. Expression of said polynucleotide 
comprises transcription of the polynucleotide, preferably into a translatable mRNA. 
Regulatory elements ensuring expression in eukaryotic cells, preferably mammalian 
cells, are well known to those skilled in the art. They usually comprise regulatory 
sequences ensuring Initiation of transcription and optionally poly-A signals ensuring 
termination of transcription and stabilization of the transcript. Additional regulatory 
elements may include transcriptional as well as translational enhancers. Possible 
regulatory elements permitting expression in prokaryotic host cells comprise, e.g., the 
lac, trp or tac promoter in E. coli, and examples for regulatory elements permitting 
expression in eukaryotic host cells are the AOX1 or GAL1 promoter in yeast or the 
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CMV-, SV40- , RSV-promoter (Rous sarcoma virus), CMV-enhancer, SV40-enhancer 
or a globin intron in mammalian and other animal cells. Beside elements which are 
responsible for the initiation of transcription such regulatory elements may also 
comprise transcription termination signals, such as the SV40-poly-A site or the tk- 
poIy-A site, downstream of the polynucleotide. In this context, suitable expression 
vectors are known in the art such as Okayama-Berg cDNA expression vector pcDVI 
(Pharmacia). pCDM8, pRc/CMV, pcDNAI, pcDNA3 (In-vitrogene), pSPORTI 
(GIBCO BRL). Preferably, said vector is an expression vector and/or a gene transfer 
or targeting vector. Expression vectors derived from viruses such as retroviruses, 
vaccinia virus, adeno-associated virus, herpes viruses, or bovine papilloma virus, 
may be used for delivery of the polynucleotides or vector of the invention into 
targeted cell population. Methods which are well known to those skilled in the art can 
be used to construct recombinant viral vectors; see, for example, the techniques 
described in Sambrook, Molecular Cloning A Laboratory Manual, Cold Spring Harbor 
Laboratory (1989) N.Y. and Ausubel, Current Protocols in Molecular Biology, Green 
Publishing Associates and Wiley Interscience, N.Y. (1994). Alternatively, the 
polynucleotides and vectors of the invention can be reconstituted into liposomes for 
delivery to target cells. 

The present invention furthermore relates to host cells transformed with a 
polynucleotide or vector of the invention. Said host cell may be a prokaryotic or 
eukaryotic cell; see supra. The polynucleotide or vector of the invention which is 
present In the host cell may either be integrated into the genome of the host cell or it 
may be maintained extrachromosomally. In this respect, it is also to be understood 
that the recombinant DNA molecule of the invention can be used for "gene targeting" 
and/or "gene replacement", for restoring a mutant gene or for creating a mutant gene 
via homologous recombination; see for example Mouellic, Proc. Natl. Acad. Sci. USA, 
87 (1990), 4712-4716; Joyner, Gene Targeting, A Practical Approach, Oxford 
University Press. 

The host cell can be any prokaryotic or eukaryotic cell, such as a bacterial, insect, 
fungal, plant, animal or human cell. Preferred fungal cells are, for example, those of 
the genus Saccharomyces, In particular those of the species S. cerevisiae. The term 
"prokaryotic" is meant to include all bacteria which can be transformed or transfected 
with a polynucleotide for the expression of a variant CYP3A4 or CYP3A7 protein or 
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fragment thereof. Prokaryotic hosts may include gram negative as well as gram 
positive bacteria such as, for example, E. coli, S. typhimurium, Serratia marcescens 
and Bacillus subtills. A polynucleotide coding for a mutant form of CYP3A4 and 
CYP3A7 variant proteins can be used to transform or transfect the host using any of 
the techniques commonly known to those of ordinary skill in the art. Methods for 
preparing fused, operably linked genes and expressing them in bacteria or animal 
cells are well-known in the art (Sambrook, supra). The genetic constructs and 
methods described therein can be utilized for expression of variant CYP3A4 and 
CYP3A7 proteins in, e.g., prokaryotic hosts. In general, expression vectors 
containing promoter sequences which facilitate the efficient transcription of the 
inserted polynucleotide are used in connection with the host. The expression vector 
typically contains an origin of replication, a promoter, and a terminator, as well as 
specific genes which are capable of providing phenotypic selection* of the 
transformed cells. The transformed prokaryotic hosts can be grown in fermentors and 
cultured according to techniques known in the art to achieve optimal cell growth. The 
proteins of the invention can then be isolated from the grown medium, cellular 
lysates, or cellular membrane fractions. The isolation and purification of the 
microbially or othenA/ise expressed polypeptides of the invention may be by any 
conventional means such as, for example, preparative chromatographic separations 
and immunological separations such as those involving the use of monoclonal or 
polyclonal antibodies. 

Thus, in a further embodiment the invention relates to a method for the production of 
variant CYP3A4 and CYP3A7 proteins and fragments thereof comprising culturing a 
host cell as defined above under conditions allowing the expression of the protein 
and recovering the produced protein or fragment from the culture. 

In another embodiment the present invention relates to a method for producing cells 
capable of expressing a variant CYP3A4 and/or CYP3A7 gene comprising 
genetically engineering cells with the polynucleotide or with the vector of the 
invention. The cells obtainable by the method of the invention can be used, for 
example, to test drugs according to the methods described in Sambrook, Fritsch, 
Maniatis (1989). Molecular cloning: a laboratory manual. Cold Spring Harbour 
Laboratory press, Cold Spring Harbour; Peyronneau, Eur J Biochem 218 (1993), 
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355-61 ; Yamazaki, Carcinogenesis 16 (1995). 2167-2170. Furthermore, the cells can 
be used to study known drugs and unknown derivatives thereof for their ability to 
complement loss of drug efficacy caused by mutations in the CYP3A4 or CYP3A7 
gene. For these embodiments the host cells preferably lack a wild type allele, 
preferably both alleles of the CYP3A4 and/or CYP3A7 gene and/or have at least one 
mutated from thereof. Alternatively, strong overexpression of a mutated allele over 
the normal allele and comparison with a recombinant cell line overexpressing the 
normal allele at a similar level may be used as a screening and analysis system. The 
cells obtainable by the above-described, method may also be used for the screening 
methods referred to herein below. 

Furthermore, the Invention relates to variant CYP3A4 and CYP3A7 proteins and 
fragments thereof encoded by a polynucleotide according to the invention or 
obtainable by the above-described methods or from cells produced by the method 
described above. In this context it is also understood that the variant CYP3A4 and 
CYP3A7 proteins according to the invention may be further modified by conventional 
methods known in the art. By providing the variant CYP3A4 and CYP3A7 proteins 
according to the present invention it is also possible to determine the portions 
relevant for their biological activity or inhibition of the same. 

The present invention furthermore relates to antibodies specifically recognizing a 
variant CYP3A4 or CYP3A7 protein according to the invention. Advantageously, the 
antibody specifically recognizes an epitope containing one or more amino acid 
substitution(s) as defined above 

Antibodies against the variant CYP3A4 or CYP3A7 protein of the invention can be 
prepared by well known methods using a purified protein according to the invention 
or a (synthetic) fragment derived therefrom as an antigen. Monoclonal antibodies can 
be prepared, for example, by the techniques as originally described in Kohler and 
Milstein. Nature 256 (1975), 495, and Galfre, Meth. Enzymol. 73 (1981), 3, which 
comprise the fusion of mouse myeloma cells to spleen cells derived from immunized 
mammals. The antibodies can be monoclonal antibodies, polyclonal antibodies or 
synthetic antibodies as well as fragments of antibodies, such as Fab, Fv or scFv 
fragments etc. Furthermore, antibodies or fragments thereof to the aforementioned 
polypeptides can be obtained by using methods which are described, e.g., in Harlow 



1 
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and Lane "Antibodies, A Laboratory Manual", CSH Press, Cold Spring Harbor, 1988. 
TInese antibodies can be used, for example, for the immunoprecipitation and 
Immunolocalization of the variant CYP3A4 and CYP3A7 proteins of the Invention as 
well as for the monitoring of the presence of such variant CYP3A4 and CYP3A7 
proteins, for example, In transgenic organisms, and for the identification of 
compounds interacting with the proteins according to the invention. For example, 
surface plasmon resonance as employed in the BIAcore system can be used to 
increase the efficiency of phage antibodies which bind to an epitope of the protein of 
the invention (Schier, Human Antibodies Hybridomas 7 (1996), 97-105; Malmborg, J. 
Immunol. Methods 183 (1995), 7-13). 

Furthermore, the present invention relates to nucleic acid molecules which represent 
or comprise the complementary strand of any of the above described polynucleotides 
or a part thereof, thus comprising at least one nucleotide difference compared to the 
corresponding wild type CYP3A4 and CYP3A7 gene nucleotide sequences specified 
by the above described nucleotide substitutions, deletions and additions. Such a 
molecule may either be a deoxyribonucleic acid or a ribonucleic acid. Such 
molecules comprise, for example, antisense RNA. These molecules may furthermore 
be linked to sequences which when transcribed code for a ribozyme thereby 
producing a ribozyme which specifically cleaves transcripts of polynucleotides 
according to the invention. 

Furthermore, the present invention relates to a vector comprising a nucleic acid 
molecule according to the invention. Examples for such vectors are described above. 
Preferably, the nucleic acid molecule present in the vector is operatively linked to 
regulatory elements permitting expression in prokaryotic or eukaryotic host cells; see 
supra. 

The present invention also relates to a method for the production of a transgenic non- 
human animal, preferably transgenic mouse, comprising introduction of a 
polynucleotide or vector of the Invention into a germ cell, an embryonic cell, stem cell 
or an egg or a cell derived therefrom. The non-human animal can be used in 
accordance with the method of the invention described below and may be a non- 
transgenic healthy animal, or may have a disorder, preferably a disorder caused by 
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at least one mutation in the CYP3A4 and/or CYP3A7 gene. Such transgenic animals 
are well suited for, e.g., pharmacological studies of drugs in connection with variant 
forms of the above described variant CYP3A4 and CYP3A7 proteins since these 
proteins or at least their functional domains are conserved between species in higher 
eukaryotes, particularly in mammals. Production of transgenic embryos and 
screening of those can be perforrried, e.g., as described by A. L. Joyner Ed., Gene 
Targeting, A Practical Approach (1993), Oxford University Press. The DNA of the 
embryos can be analyzed using, e.g.. Southern blots with an appropriate probe. 

The invention also relates to transgenic non-human animals such as transgenic 
mouse, rats, hamsters, dogs, monkeys, rabbits, pigs, C. elegans and fish such as 
) torpedo fish comprising a polynucleotide or vector of the Invention or obtained by the 
method described above, preferably wherein said polynucleotide or vector is stably 
integrated into the genome of said non-human animal, preferably such that the 
presence of said polynucleotide or vector leads to the expression of the variant 
CYP3A4 and/or CYP3A7 gene of the invention. It may have one or several copies of 
the same or different polynucleotides of the variant CYP3A4 or CYP3A7 gene. This 
animal has numerous utilities, including as a research model for drug tolerability and 
therefore, presents a novel and valuable animal in the development of therapies, 
treatment, etc. for diseases caused by deficiency or failure of drug metabolization in 
the cell. Accordingly, in this instance, the mammal is preferably a laboratory animal 
such as a mouse or rat. 

Preferably, the transgenic non-human animal of the invention further comprises at 
least one inactivated wild type allele of the CYP3A4 and/or CYP3A7 gene. This 
embodiment allows for example the study of the interaction of various variant forms 
of CYP3A4 and CYP3A7 proteins. It might be also desirable to inactivate CYP3A4 
and/or CYP3A7 gene expression or function at a certain stage of development and/or 
life of the transgenic animal. This can be achieved by using, for example, tissue 
specific, developmental and/or cell regulated and/or inducible promoters which drive 
the expression of, e.g., an antlsense or ribozyme directed against the RNA transcript 
of the CYP3A4 or CYP3A7 gene; see also supra. A suitable inducible system is for 
example tetracycline-regulated gene expression as described, e.g., by Gossen and 
Bujard (Proc. Natl. Acad. Sci. 89 USA (1992), 5547-5551) and Gossen et at. (Trends 
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Biotech. 12 (1994), 58-62). Similar, the expression of the variant CYP3A4 and 
CYP3A7 gene may be controlled by such regulatory elements. 

With the variant CYP3A4 and CYP3A7 polynucleotides and proteins and vectors of 
the invention, it is now possible to study in vivo and in vitro the efficiency of drugs in 
relation to particular mutations in the CYP3A4 or CYP3A7 gene of a patient and the 
affected phenotype. Furthermore, the variant CYP3A4 and CYP3A7 proteins of the 
invention can be used to determine the pharmacological profile of drugs and for the 
identification and preparation of further drugs which may be more effective for the 
treatment of, e.g., cancer, in particular for the amelioration of certain phenotypes 
caused by the respective mutations such as those described above. 

Thus, a particular object of the present invention concerns drug/pro-drug selection 
and formulation of pharmaceutical compositions for the treatment of diseases which 
are amenable to chemotherapy taking into account the polymorphism of the variant 
form of the CYP3A4 or CYP3A7 gene that cosegregates with the affected phenotype 
of the patient to be treated. This allows the safe and economic application of drugs 
which for example were hitherto considered not appropriate for therapy of, e.g., 
cancer due to either their side effects In some patients and/or their unreliable 
pharmalogical profile with respect to the same or different phenotype(s) of the 
disease. The . means and methods described herein can be used, for example, to 
Improve dosing recommendations and allows the prescriber to anticipate necessary 
dose adjustments depending on the considered patient group. 

In a further embodiment the present invention relates to a method of identifying and 
obtaining an CYP3A4 or CYP3A7 inhibitor capable of modulating the activity of a 
molecular variant of the CYP3A4 or CYP3A7 gene or its gene product comprising the 
steps of 

(a) contacting the variant CYP3A4 or CYP3A7 protein or a cell expressing a 
molecular variant gene comprising a polynucleotide of the invention in the 
presence of components capable of providing a detectable signal in response 
to drug metabolization, with a compound to be screened under conditions to 
permit CYP3A4 or CYP3A7 mediated drug metabolization, and 
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(b) 



detecting the presence or absence of a signal or increase of a signal 
generated from the metabolized drug, wherein the presence or increase of the 
signal is indicative for a putative inhibitor. 

The term "compound" in a method of the invention includes a single substance or a 
plurality of substances which may or may not be identical. 

Said compound(s) may be chemically synthesized or produced via microbial 
fermentation but can also be comprised In, for example, samples, e.g., cell extracts 
from, e.g., plants, animals or microorganisms. Furthermore, said compounds may be 
known in the art but hitherto not known to be useful as an inhibitor, respectively. The 
plurality of compounds may be, e.g., added to the culture medium or injected into a 
cell or non-human animal of the invention. 

If a sample containing (a) compound(s) is identified in the method of the invention, 
then it is either possible to isolate the compound from the original sample identified 
as containing the compound, in question or one can further subdivide the original 
sample, for example, if it consists of a plurality of different compounds, so as to 
reduce the number of different substances per sample and repeat the method with 
the subdivisions of the original sample. It can then be determined whether said 
sample or compound displays the desired properties, for example, by the methods 
described herein or in the literature (e.g. (13) and Lehmann, J Clin Invest 102 (1998), 
1016-23). Depending on the complexity of the samples, the steps described above 
can be performed several times, preferably until the sample identified according to 
the method of the invention only comprises a limited number of or only one 
substance(s). Preferably said sample comprises substances of similar chemical 
and/or physical properties, and most preferably said substances are identical. The 
methods of the present invention can be easily performed and designed by the 
person skilled in the art, for example in accordance with other cell based assays 
described in the prior art or by using and modifying the methods as described herein. 
Furthermore, the person skilled in the art will readily recognize which further 
compounds and/or enzymes may be used in order to perform the methods of the 
invention, for example, enzymes, if necessary, that convert a certain compound into 
the precursor which in turn represents a substrate for the CYP3A4 or GYP3A7 
protein. Such adaptation of the method of the invention is well within the skill of the 
person skilled in the art and can be performed without undue experimentation. 
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d bed. ,or example, in Hashinco, Eur J Blochem 2,8 (1993), 585-95 wherein 
™,on assays wlth chimeric CYP3A4 genes in HepG2 ce„s are described. 
S,m,larly, the vanant CYP3A4 and/or CYP3A7 genes can be expressed or co- 
expressed in HepG2 celis and analyzed for .heir transcriptlonai activity and catalytic 
pr pert,es o, CyP3A4 cr CYP3A7. Such an assay can also be used ,or studying 1 
oa,aly„c properties o, the CYP3A4 and CYP3A7 on its substrates such as steroids 
(testosterone, progesterone, androstenedione, Cortisol, 17B-oestradiol 17a- 
ethynyloestradioi), antibiotics (erythromycin), immunosuppressive (cyclosponne A) 
benzod,azepine (midazolam), benzothiazepine derivatives (diltiazem, triazolam) and 
n.fed,p,ne. In particular, such tests are useful to add in predicting whether a given 
drug w. ,n,eract in an individual carding the respective variant CYP3A4 and/or 

Ih !bl!T' ^TT ^ --^^-ce 

wrth above descnbed methods o, the present Invention is also described in (22) m 
add,„on heterologous expression systems such as yeas, can be used in o de ,o 
study ,he s.abili,y, binding properties and cata^ic activities o, ,he gene products o, 
the vanant CYP3A4 and CYP3A7 genes compared to the corresponding wild t^e 
gene product. As mentioned before, the molecular variant CYP3A4 and CYP3A7 
genes and their gene products, particularly when employed in the above described 
2*ods, can be used tor pharmacological and ,oxico,ogical s,udies o, ,he 
me.abol,sm Of drugs. Preferred drugs ,o be tested in accordance wi,h the methods of 

l.m,ted ,0 n„ed,p,ne, erythromycin, troleandomydn, quinidlne, cyclosporin A ,7 a- 
ethynyleslradiol, lidocaine, diltiazem. dexamethasone, RU486, see also supra ' 
Cornpounds which can be used in accordance wi,h me present invention 'include 

pep.,dom,me„cs, PNAs and the like. Said compounds can also be fun tJi 

I °' - ^-escribed a 

.ho e l-r'n T'^" ''"'^'^ ^-wn .0 

.hose s«ed ,n .he art and are described In, for example. Beilsfein, Handbook of 

N^Y. 100,0 U.S.A. and Organic Synlhesis, Wiley, New York, USA. Furthermore said 
denva.«es and analogues can be tested for their effects according to me*," 



known in the art or as described. Furthermore, peptide mimetics and/or computer 
aided design of appropriate drug derivatives and analogues can be used for 
example, according to the methods described below. Such analogs comprise 
molecules having as the basis structure of known CYP3A4 and CYP3A7-substrates 
and/or inhibitors and/or modulators; see infra. 

Appropriate computer programs can be used for the identification of interactive sites 
of a putative inhibitor and the CYP3A4 or CYP3A7 protein of the invention by 
computer assistant searches for complementary structural motifs (Fassina 
Immunomethods 5 (1994), 114-120). Further appropriate computer systems for the 
computer aided design of protein and peptides are described in the prior art for 
example, in Berry, Biochem. Soc. Trans. 22 (1994). 1033-1036; Wodak. Ann N Y 
Acad. Sci. 501 (1987), 1-13; Pabo. Biochemistry 25 (1986), 5987-5991. The results 
Obtained from the above-described computer analysis can be used in combination 
with the method of the invention for. e.g.. optimizing known inhibitors. Appropriate 
peptidomimetics and other inhibitors can also be identified by the synthesis of 
peptidomimetic combinatorial libraries through successive chemical modification and 
testing the resulting compounds, e.g., according to the methods described herein 
Methods for the generation and use of peptidomimetic combinatorial libraries are 
described in the prior art. for example in Ostresh, Methods in Enzymology 267 
(1996). 220-234 and Domer, Bioorg. Med. Chem. 4 (1996), 709-715. Furthermore 
the three-dimensional and/or crystallographic structure of inhibitors and the CYP3A4 
or CYP3A7 protein of the invention can be used for the design of peptidomimetic 
drugs (Rose, Biochemistry 35 (1996), 1.2933-12944; Rutenberg. Bioorg. Med Chem 
4(1996), 1545-1558). 

In summary, the present invention provides methods for identifying and obtaining 
compounds which can be used in specific doses for the treatment of specific forms of 
diseases, e.g., cancer the chemotherapy of which is complicated by malfunctions of 
the CYP3A4 or CYP3A7 gene often resulting in an altered activity or level of drug 
metabolization or sensitive phenotype. 

In a preferred embodiment of the method of the Invention said cell is a cell of or 
Obtained by the method of the Invention or is comprised in the above-described 
transgenic non-human animal. 
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In a further embodiment the present invention relates to a method of identifying and 
Obtaining an CYP3A4 or CYP3A7 inhibitor oapable of modulating the activity of a 
molecular vadant of the CYP3A4 or CYP3A7 gene or its gene product comprising the 
Steps of ^ 

(a) contacting the variant CYP3A4 or CYP3A7 protein of the invention with a first 
molecule known to be bound by CYP3A4 or CYP3A7 protein to form a first 
complex of said protein and said first molecule; 

(b) contacting said first complex with a compound to be screened- and 

(c) measuring whether said compound displaces said first molecule from said first 
complex. - - 

Advantageously, in said method said measuring step comprises measuring the 
formation of a second complex of said protein and said inhibitor candidate 
Preferably, sa,d measuring step comprises measuring the amount of said first 
molecule that is not bound to said protein. 

in a particularly preferred embodiment of the above-described method of said first 
molecule is nifedipine, rifampioine or corticosterone. Furthermore, it is preferred that 
in the method of the invention said first molecule is labeled, e.g., with a radioactive or 
fluorescent label. 

In a still further embodiment the present invention relates to a method of diagnosing 
a disorder related to the presence of a molecular variant CYP3A4 or CYP3A7 gene 
or susceptibility to such a disorder comprising 

(a) determining the presence of a polynucleotide of the invention in a sample from 
a subject; and/or 

(b) determining the presence of a variant fom, of CYP3A4 or CYP3A7 protein for 
example, with the antibody of the invention. 

In accordance with this embodiment of the present invention, the method of testing 
the status of a disorder or susceptibility to such a disorder can be effected by using a 
polynucleotide or a nucleic acid molecule of the invention, e.g., in the form of a 
Southern or Northern blot or In situ analysis. Said nucleic acid sequence may 



hybnd,ze to a coding region of either of the genes or to a non-coding region e a 
■ntro. ,n the case that a oon^ple.enta^ sequence Is employed in the Ithod of he 
.nvent,on said nucieic acid .olecuie can again be used in No«hern bo s 
Add .onai^, said testing can be done in coniunCon with an actuai bioCng. e Tl 

Furthermore, a pnmer or oligonucleotide can also be used for hybridizing to one of 
me above mentioned CYP3A4 or cyP3A7 genes or corresponding mRNAs. The 
nucie,c acds used for hybridization can, of course, be conveniently labeled by 

IT ? °' ' """^^""^ °' "^"^'^ are we 

known ,n the art. The labeling of said nucleic acid molecules can be effected by 

conventional methods. 

Additionally, the presence or expression of variant CYP3A4 and CYP3A7 genes can 
be monitored by using a primer pair that specifically hybridizes to either of the 
corresponding nucleic acid sequences and by carrying out a PGR reaction according 
to standard procedures. Specific hybridlzaUon of the above mentioned probes or 
pnmers preferably occurs a. stringent hybridization cond«lons. The term -stringent 

Motecular C,on,ng, A Laboratory Manual" second ed., CSH Press, Cold Spring 

Higgins eds., IRL Press, Oxford, 1985. Furthemiore, the mRNA, cRNA, cDNA or 
9enom,c DNA obtained from the subiec, may be sequenced to identify mutals 
Which may be characteristic fingerprints of mutations in the CYP3A4 and CYP3A7 
gene^The present Invention further comprises methods wherein such a fingerprint 

Z : rTT -"J-' ~ly the 

inl rt PMA^r *° ^"^^ - Xn-n 

n th art^ RNA fingerprints may be pertonned by, for example, digesting an RNA 

sample obtained from the subjeC with a suitable RNA-Enzyme, for example RNase 
T„ RNase T. or the like or a ribozyme and, for example, eleotrophoretically 
separating and detecting the RNA fragments as deechbed above 
Further modifications of the above-mentioned embodiment of the invention can be 

fr m tf^^sT, ' experimentation 
from this disclosure; see, e.g., the examples. An additional embodiment of the 
present invention relates to a method wherein said determination is effected by 
employing an antibody of m invenUon or fragment thereof. The antibody used in the 



method of the invention may be labeled with deteotable tags such ae a histidine flags 
or a biotin molecule. 

In a preferred embodiment of the present invention, the above described methods 
comprise PGR. ligase chain reaction, restriction digestion, direct sequencing, nucleic 
acd amplification techniques, hybridization techniques or immunoassays (Sambrook 
et al., loc. cit. CSH cloning, Harlow and Lane loc. cit. CSH antibodies). 

In a preferred embodiment of the method of the present invention said disorder Is 



cancer, 



In a further embodiment of the above-described method, a further step comprising 
administering to the subject a medicament to abolish or alleviate said variations in 
the CYP3A4 or CYP3A7 gene in accordance with all applications of the method of 
the invention allows treatment of a given disease before the onset of clinical 
symptoms due to the phenotype response caused by the CYP3A4 or CYP3A7 gene. 

In a preferred embodiment of the method of the invention said medicament are 
chemotherapeutic agents such as substrates of CYP3A4: paclitaxen (Eur J Drug 
Metab Pharmacokinet 23 (1998), 417-24), tamoxifen and toremifene (Drug Metab 
D^pos 27(1999), 681-8; Clin Pharmacol Ther 64 (1998), 648-54; Clin Pharmacol 
Ther 57 (1995), 628-35), trofosfamide (Cancer Chemother Pharmacol 44(1999) 327 
334). cyclophosphamide and ifosfamide (Drug Metab Dispos 27 (1999) 655-66- 
Cancer Res 58 (1998). 4391-401; Br J Clin Pharmacol 40 (1995). 523-30)' taxotere 
(Pharmacogenetics 8 (1998). 391-401; Clarke. Clin Pharmacokinet 36 (1999) 99- 
114). 

In another preferred embodiment of the above-described methods, said method 
further comprises introducing 

(i) a functional and expressible wild type CYP3A4 or CYP3A7 gene or 

(ii) a nucleotide acid molecule or vector of the invention into cells. 

In this context and as used throughout this specification, "functional" CYP3A4 and 
CYP3A7 gene means a gene wherein the encoded protein having part or all of the 



pnmary structural conformation of the wild typo CYP3A4 and CYP3A7 protein I e 
po^essing the bioiogica, property of metabolizing drugs and controlling the CYP3A4' 
CYP3A7 gene, respectively. This embodiment of the present invention is suited fo^ 

CYP3A4 and/or CYP3A7 gene would allow the conclusion that said expression is 
interrelate! to the generation or maintenance of a corresponding phenoCprJthe 

llT' ' '''"'^ *^ -P--"" 'o low 

I vels or abo„sh the same. This can be done, for example, by at leas, partial 

e ,m,na.,on o, the expression of the mutant gene by biolcgica, means, for exam^e.T 

the use Of nbczymes, antisense nucleic acid molecules, intracellular antibodies or the 

above described Inhibitors against the variant forms of these CYP3A4 and/or 

CYP3A7 prote,ns. Furthermore, pham,aceutical products may be developed that 

reduce the expression levels of the corresponding mutant proteins and genes. 

In a further embodiment the invention relates to a method for the production of a 
Pham,aceuticai composition comprising the steps of any one of the above described 
methods and synthesizing and/or fomiulating the compound identified In step (b) or a 
derivative or homologue thereof in a pham,aceu.ical,y acceptable ton. The 
therapeutically useful compounds identified according to the method o, the invention 
may be formulated and administered to a pa.len, as discussed above. For uses and 
therapeutic doses detem^ined to be appropriate by one skilled in the art see infra. 

Pharmaceutical composition comprising the steps of the above-described methods- 
and formulating a drug or pro-drug in the form suitable for therapeutic application and 

iZt":' °' °' ''^'"^^ Of th 



Doigs or pro-drugs after their /„ vf.o administration are metabolized in order to be 
eliminated either by excrefcn or by metabolism to one or more active or inactive 
metal^ites (Meyer, J. Pham^acoMnet. Biophan.. 24 (1996), 449-459). Thus ramlr 
than using the actual compound or Inhibitor identified and obtained in accordance 
wi«„he methods o, the present Invention a corresponding formulation as a prolg 
can be used which is converted into Its active in the patient. Precautionary measure! 
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that may be taken for the application of pro-drugs and drugs are described in the 
literature; see, for review, Ozama, J. Toxicol. Sci. 21 (1996), 323-329). 

In a preferred embodiment of the method of the present invention said drug or 
prodrug is a derivative of a medicament as defined hereinbefore. 

In a still further embodiment the present invention relates to an inhibitor identified or 
obtained by the method described hereinbefore. Preferably, the inhibitor binds 
specifically to the variant CYP3A4 or CYP3A7 protein of the invention. The 
antibodies, nucleic acid molecules and inhibitors of the present invention preferably 
have a specificity at least substantially identical to binding specificity of the natural 
ligand or binding partner of the CYP3A4 or CYP3A7 protein of the invention. An 
antibody or inhibitor can have a binding affinity to the CYP3A4 or CYP3A7 protein of 
the invention of at least 10^ M'\ preferably higher than 10'' M'^ and advantageously 
up to 10^^ M'"" in case CYP3A4 or CYP3A7 activity should be repressed. Hence, in a 
preferred embodiment, a suppressive antibody or inhibitor of the invention has an 
affinity of at least about 10"'' M, preferably at least about 10"^ M and most preferably 
at last about lO'""^ M. 

Furthermore, the present invention relates to the use of an oligo- or polynucleotide 
for the detection of a polynucleotide of the invention and/or for genotyping of 
corresponding individual CYP3A4 or CYP3A7 alleles. Preferably, said oligo- or 
polynucleotide is a polynucleotide or a nucleic acid molecule of the invention 
described before. 

In a particular preferred embodiment said oligonucleotide is about 15 to 50, 
preferably 20 to 40, more preferably 20 to 30 nucleotides in length and comprises the 
nucleotide sequence of any one of SEQ ID NOS: 1 to 127, 140, 141 or 142 or a 
complementary sequence. 

Hence, in a still further embodiment, the present invention relates to a primer or 
probe consisting of an oligonucleotide as defined above. In this context, the term 
"consisting of" means that the nucleotide sequence described above and employed 
for the primer or probe of the invention does not have any further nucleotide 
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sequences of the CYP3A4 or CYP3A7 gene immediately adjacent at its 5' and/or 3' 
end. However, other moieties such as labels, e.g., blotin molecules, histidin flags, 
antibody fragments, colloidal gold, etc. as well as nucleotide sequences which do not 
correspond to the CYP3A4 or CYP3A7 gene may be present In the primer and 
probes of the present invention. Furthermore, it is also possible to use the above 
described particular nucleotide sequences and to combine them with other nucleotide 
sequences derived from the CYP3A4 or CYP3A7 gene wherein these additional 
nucleotide sequences are interspersed with moieties other than nucleic acids or 
wherein the nucleic acid does not correspond to nucleotide sequences of the 
CYP3A4 or CYP3A7 gene. Furthermore, it is evident to the person skilled in the art 
that the oligonucleotide can be modified, for example, by thio-phosphate-backbones 
and/or base analogs well known in the art (Flanagan, Proc. Natl. Acad. Sci. USA 96 
(1999), 3513-8; Witters, Breast Cancer Res. Treat. 53 (1999), 41-50;' Hawley, 
Antisense Nucleic Acid Drug Dev. 9 (1999), 61-9; Peng Ho, Brain Res. Mol. Brain 
Res. 62 (1998), 1-11; Spiller, Antisense Nucleic Acid Drug Dev. 8 (1998), 281-93; 
Zhang, J. Pharmacol. Exp. Ther. 278 (1996), 971-9; Shoji, Antimicrob. Agents 
Chemother. 40 (1996), 1670-5; Crooke, J. Pharmacol. Exp. Ther. 277 (1996), 923- 
37). 

In addition, the present invention relates to the use of an antibody or a substance 
capable of binding specifically to the gene product of an CYP3A4 or CYP3A7 gene 
for the detection of the variant CYP3A4 or CYP3A7 protein of the invention, the 
expression of a molecular variant CYP3A4 or CYP3A7 gene comprising a 
polynucleotide of the invention and/or for distinguishing CYP3A4 and CYP3A7 alleles 
comprising a polynucleotide of the invention. 

Moreover, the present invention relates to a composition, preferably pharmaceutical 
composition comprising the antibody, the nucleic acid molecule, the vector or the 
inhibitor of the present invention, and optionally a pharmaceutically acceptable 
carrier. These pharmaceutical compositions comprising, e.g., the inhibitor or 
pharmaceutically acceptable salts thereof may conveniently be administered by any 
of the routes conventionally used for drug administration, for instance, orally, 
topically, parenterally or by inhalation. Acceptable salts comprise acetate, 
methylester, HCI, sulfate, chloride and the like. The compounds may be administered 
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in conventional dosage forms prepared by combining the drugs with standard 
pharmaceutical carriers according to conventional procedures. These procedures 
may involve mixing, granulating and compressing or dissolving the ingredients as 
appropriate to the desired preparation. It will be appreciated that the form and 
character of the pharmaceutically acceptable character or diluent is dictated by the 
amount of active ingredient with which it is to be combined, the route of 
administration and other well-known variables. The carrier(s) must be "acceptable" in 
the sense of being compatible with the other ingredients of the formulation and not 
deleterious to the recipient thereof. The pharmaceutical carrier employed may be, for 
example, either a solid or liquid. Exemplary of solid carriers are lactose, terra alba, 
sucrose, talc, gelatin, agar, pectin, acacia, magnesium stearate. stearic acid and the 
like. Exemplary of liquid carriers are phosphate buffered saline solution, syrup, oil 
such as peanut oil and olive oil, water, emulsions, various types of wetting agents 
sterile solutions and the like. Similarly, the carrier or diluent may include time delay 
material well known to the art, such as glyceryl mono-stearate or glyceryl distearate 
alone or with a wax. 

The dosage regimen will be determined by the attending physician and other clinical 
factors; preferably in accordance with any one of the above described methods. As is 
well known in the medical arts, dosages for any one patient depends upon many 
factors, including the patient's size, body surface area, age, the particular compound 
to be administered, sex. time and route of administration, general health, and other 
drugs being administered concurrently. Progress can be monitored by periodic 
assessment. 

Furthermore, the use of pharmaceutical compositions which comprise antisense- 
oligonucleotides which specifically hybridize to RNA encoding mutated versions of a 
CYP3A4 or CYP3A7 gene according to the invention or which comprise antibodies 
specifically recognizing mutated CYP3A4 or CYP3A7 protein but not or not 
substantially the functional wild-type form is conceivable in cases in which the 
concentration of the mutated form in the cells should be reduced. 
Thanks to the present invention the particular drug selection, dosage regimen and 
corresponding patients to be treated can be determined in accordance with the 
present invention. The dosing recommendations will be indicated in product labeling 
by allowing the prescriber to anticipate dose adjustments depending on the 
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cons.dered patien. group, with information .ha. avoids prescribing the w,ong drug to 
the wrong patients at ttie wrong dose. 

Furthermore, the present inven.ion reia.es .0 a diagnostic composition or kit 
compnsmg any one of the afore^escribed poiynucleotides. vectors, host cells 
vanant CyP3A4 and CYP3A7 proteins, antibodies, inhibitors, nuCeic acid moieouies 

dltUr'''""'"' °' """""^"^ "'^"^ >°' 

The kit Of the invention may contain further ingredients such as selection markers 
and components for selective media suitable for the generation of transgenic cells 
and an,ma s. The kit o, the invention may advantageous^ be used for carding out a 
method Of the invention and cculd be, inter alia, employed in a variety of applications, 
e.g., ,n the diagnostic field or as research tool. The parts of me ki. of .he rnvemion 

ZsTT^' °' -""^'"-^ °' -"icomainer 

un,^. Manufacture of the kit follows preferab^ standard procedures which are known 
.0 the person skilled in the art. The kit or diagnostic composmons may be used for 
methods for detecting expression of a mutant form of CYP3A4 or CYP3A7 gene in 
accordance with any one of the above-described methods of the invention 
emp,oy,ng. for example, immunoassay techniques such as radioimmunoassay o; 
en.yme,mmunoassay er preferably nucleic acid hybridization and/or amplification 
techniques such as those described herein before and in the examples. 

Some genetic changes lead to altered protein conformational states. For example 
some variant CYP3A4 or CYP3A7 proteins may possess a tertiary structure thai 
ren ers them far less capable of facliita.ing drug metaboiization and transcrip on 
initiation, respectively. Restoring the normal or regulated confortnation of mutated 
p™te,ns ,s the most elegant and specific means to correct these molecular defects 
al hough „ is difficult. Pharmacological manipulations thus may aim at restore on ^ 
w,i -type con,om,ation of the protein. Thus, the. poiynucieoMes and encld 
pro e,ns o, the present invention may also be used to design and/or IdentL 
molecules which are capable of ac^Vatlng the wild-type function of a CyP3A4 o 
CYP3A7 gene or protein. 
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In another embodiment the present invention relates to the use of a drug or prodrug 
for the preparation of a pharmaceutical composition for the treatment or prevention of 
a disorder diagnosed by the method described hereinbefore. 

Furthermore, the present invention relates to the use of an effective dose of a nucleic 
acid sequence encoding a functional and expressible wild type CYP3A4 or CYP3A7 
protein for the preparation of a pharmaceutical composition for treating, preventing 
and/or delaying a disorder diagnosed by the method of the invention. A gene 
encoding a functional and expressible CYP3A4 or CYP3A7 protein can be introduced 
into the cells which in turn produce the protein of interest. Gene therapy, which is 
based on introducing therapeutic genes into cells by ex-wVo or in-vivo techniques is 
one of the most important applications of gene transfer. Suitable vectors and 
methods for in-vitro or in-vivo gene therapy are described in the literature and are 
known to the person skilled in the art; see, e.g., Giordano, Nature Medicine 2 (1996), 
534-539; Schaper, Girc. Res. 79 (1996), 911-919; Anderson, Science 256 (1992), 
808-813; Isner, Lancet 348 (1996), 370-374; Muhlhauser, Giro. Res. 77 (1995), 1077- 
1086; Wang, Nature Medicine 2 (1996), 714-716; W094/29469; WO 97/00957 or 
Schaper, Current Opinion in Biotechnology 7 (1996), 635-640, and references cited 
therein. The gene may be designed for direct introduction or for introduction via 
liposomes, or viral vectors (e.g. adenoviral, retroviral) into the cell. Preferably, said 
cell is a germ line cell, embryonic cell, or egg cell or derived therefrom, most 
preferably said cell is a stem cell. 

As is evident from the above, it is preferred that in the use of the invention the nucleic 
acid sequence is operatively linked to regulatory elements allowing for the expression 
and/or targeting of the GYP3A4 or GYP3A7 protein to specific cells. Suitable gene 
delivery systems that can be employed in accordance with the invention may include 
liposomes, receptor-mediated delivery systems, naked DNA, and viral vectors such 
as herpes viruses, retroviruses, adenoviruses, and adeno-associated viruses, among 
others. Delivery of nucleic acids to a specific site in the body for gene therapy may 
also be accomplished using a biolistic delivery system, such as that described by 
Williams (Proc. Natl. Acad. Sci. USA 88 (1991), 2726-2729). Standard methods for 
transfecting cells with recombinant DNA are well known to those skilled in the art of 
molecular biology, see, e.g., WO 94/29469; see also supra. Gene therapy may be 
carried out by directly administering the recombinant DNA molecule or vector of the 
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invention to a patient or by transfecting cells with tiie polynucleotide or vector of the 
invention ex vivo and infusing the transfected cells into the patient. 

In a preferred embodiment of the uses and methods of the invention, said disorder is 
cancer. 

These and other embodiments are disclosed or are obvious from and encompassed 
by the description and examples of the present invention. Further literature 
concerning any one of the methods, uses and compounds to be employed in 
accordance with the present invention may be retrieved from public libraries, using 
for example electronic devices. For example the public database "Medline" may be 
utilized which is available on Internet, e.g. under 
http://www.ncbi.nlm.nih.gov/PubMed/medline.html. Further databases and 
addresses, such as http://www.ncbi.nlm.nih.gov/, http://www.infobiogen.fr/, 
http://www.fmi.ch/biology/research_tools.html, http://www.tigr.org/, are known to the 
person skilled in the art and can also be obtained using, e.g., http://www.lycos.com. 
An overview of patent information in biotechnology and a survey of relevant sources 
of patent information useful for retrospective searching and for current awareness is 
given in Berks, TJBTECH 12 (1994), 352-364. 

The pharmaceutical and diagnostic compositions, uses, methods of the invention can 
be used for the diagnosis and treatment of all kinds of diseases hitherto unknown as 
being related to or dependent on variant CYP3A4 and CYP3A7 genes. The 
compositions, methods and uses of the present invention may be desirably employed 
in humans, although animal treatment is also encompassed by the methods and 
uses described herein. 

Brief description of the figures 

Figure 1: Differences in the genetic makeup influence the efficacy and safety of 
drug treatment. 



Figure 2: 



A current model of the regulation of CYP3A4 by hPXR. 
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Figure 3: (A) Structure of the CYP3A4 gene as described in Ref. by Hashimoto, 
Eur Blochem 218 (1993) 585-95 and confirmed in this study. Coding 
regions are indicated as filled rectangles, the non-coding 5' untranslated 
region as a dashed rectangle. Arrowheads represent the positions of 
oligonucleotides used to screen the coding region of the gene (see 
Table 2 for the oligonucleotide sequences). (B) Determination of the 
exon flanking sequences. Double-headed arrows indicate genomic 
regions amplified by PGR. Single-headed arrows indicate sequences 
obtained by direct sequencing of BAG clones. The sequences of 
oligonucleotides shown in (B) are given in Table 1 . 

Figure 4: Examples of polymorphisms in the GYP3A4 and GYP3A7 genes. 

Numbering of polymorphic sites within the exons of GYP3A4 is based 
on the GenBank sequence M14096. Numbering of polymorphic sites 
within the exons of CYP3A7 is based on the GenBank sequence 
gi4503232. 

An enzymatic test for the CYP3A7 exon 11 G1229G (Thr409Arg) 
polymorphism. (A) The mutation eliminates the unique Alwnl restriction 
site from the exon 11 -containing fragment amplified with 
oligonucleotides 3A442F and 3A438R (arrowheads). (B) Genotyping of 
a wild-type (wt/wt) and a heterozygous (wt/C1229G) DNA sample by 
means of the Alwnl restriction digest. 



Figure 5: 



Figures: Genomic sequences and polymorphisms in CYP3A4 and CYP3A7 
genes. Primers used for the amplification and sequencing (Table 1), as 
well as splice sites are underlined. Thick underlined are polymorphic 
sites and they are shown as the wild-type and variant base, separated 
by an arrow. 

Figure 7: Immunoblot of wild-type CYP3A4 (3A4), R1 30Q (M2), T363M (M5), and 
P416L (M7). P450 (Ipmol) was loaded into each lane, transferred to 
nitrocellulose, and probed with anti-3A12 IgG as described in Materials 
and Methods. 



The invention will now be described by reference to the following examples which are 
merely illustrative and are not to be construed as a limitation of the scope of the 
present invention. 

Examples 

Example 1: Genomic orgar^lzation and oligonucleotides for the amplification of 
the coding regions of CYP3A4 and CYP3A7 

The genomic structure of CYP3A4 has been described In an earlier work (Hashimoto 
Eur. J. Biochem. 218 (1993), 585-95); however, the published exon flanking 
sequences were too short to design oligonucleotides for exon amplification. In 
accordance with the present invention the sequences surrounding the CYP3A4 
^ exons 5, 7, 9, 12 and 13 have been elucidated by sequencing of PGR (Polymerase 
Chain Reaction)-amplified fragments containing parts of two neighboring exons and 
the intercalated intron. The sequences surrounding the CYP3A4 exons 6, 8, 10 and 
1 1 have been elucidated by sequencing of PCR-amplified fragments containing parts 
of two neighboring exons and the intercalated intron as well as by sequencing the 
3A4-containing bacterial artificial chromosome (BAG) (GenBank Accession Number 
AF280107). The sequences surrounding the CYP3A4 exons 1, 3 and 4 were derived 
from a GYP3A4-containing bacterial artificial chromosome (BAG) (GenBank 
Accession Number AF280107). Oligonucleotides used for the amplification of these 
fragments and for sequencing of the BAG were derived from a CYP3A4 cDNA 
sequence (GenBank accession number M14096) upon consideration of the 
exon/intron organization of the gene (Hashimoto, Eur J Biochem 218 (1993). 585- 
95). The sequences thus obtained were used to design oligonucleotides for the 
amplification of exons 1 and 3 -13. Oligonucleotides for the amplification of exons 2 
were designed using the recently determined sequence of a CYP3A4-containing 
bacterial artificial chromosome (GenBank Accession Number AF280107) 

The sequences thus obtained were used to design oligonucleotides for amplification 
of the gene exons. Their composition and the sizes of the resulting DNA fragments 
are given in Table 2. Besides the exon sequences, fragments amplified contain also 
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some flanking intronic sequences, including the splice sites, as well as some 5'- and 
3'-UTR (untranslated region) sequences of the gene. 

Example 2: Isolation of genomic DNA, amplification, purification and 
sequencing of CYP3A4 and CYP3A7 gene fragments 

Genomic DNA was isolated from Caucasian blood or liver samples using Qiagen 
blood and tissue DNA Isolation kits. Samples were collected by the Institute of 
Clinical Pharmacology, University Medical Center Charite, Humboldt University In 
Berlin, Germany, by the Dr. Margarete Fischer-Bosch-Institute of Clinical 
Pharmacology in Stuttgart, Germany, by Department of Pharmacology. Biozentrum 
University of Basel, Switzerland, and by Parexel International, Berlin. Germany' 
under consideration of ail necessary ethical and legal requirements. Conditions for 
the amplification of CYP3A4 and CYP3A7 gene fragments by PCR are given in Table 
2, respectively. The complete sequences of the amplicons are given in Figure 6. The 
quality of amplicons was routinely checked by agarose gel electrophoresis The 
fragments were then processed through PCR purification columns (Qiagen) which 
remove all the components of the PCR that could othenA/ise interfere with the 
subsequent sequencing reaction. 

The sequencing reaction was performed using the dye-terminator method and the 
samples were then resolved on polyacrylamide gels (Perkin-Elmer 377 and 3700 
sequencing machines). Both strands were routinely sequenced to assure high 
accuracy of the results and the detection of heterozygotes. The sequences were 
visually inspected for their quality and then analyzed for the presence of 
polymorphisms using the PHRED/PHRAP/POLYPHRED/CONSED software package 
(University of Washington. Seattle. USA). 
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Example 3: Polymorphisms in the CYP3A4 and CYP3A7 gene 



In accordance with the present Invention, a panel of Caucasian DNA samples were 
screened for nnutatlons in exons 1 to 13 of the CYP3A4 gene, and in exon 11 of the 
CYP3A7 gene. The results for CYP3A4 were obtained for between 296 and 426 
chromosomes per PGR fragment (Table 3). A comparison of the sequences 
generated with the three CYP3A4 cDNAs originally published (Beaune, Proc Natl 
Acad Sci U S A 83 (1986), 8064-8; Gonzalez, Dna 7 (1988), 79-86; Bork, J Biol 
Chem 264 (1989), 910-9) indicates that only the cDNA with a GenBank Accession 
Number Ml 8907 (Gonzalez, Dna 7 (1988), 79-86) encodes for a CYP3A4 protein 
such as expressed by most Caucasians. An overview of variants found is given in 
Table 3. In the CYP3A4 gene, we have identified altogether 18 variant positions, all 
of which are single nucleotide polymorphisms (SNPs). Ten variants are located within 
the protein-coding regions of CYP3A4, one within the untranslated region (3' UTR) of 
exon 13. and seven within the exon-flanking intronic sequences. No variants have 
been found within the splice sites of the gene's exons. Fifteen out of 18 variants had 
allelic frequencies below, and three, including one missense mutation (Ml), above, 
1% (Table 3). The highest allelic frequency (9.5%) exhibits the M14 variant located in 
intron 10 (Table 3). 

Out of 10 variants located in the protein-coding regions of the gene two are silent, 
whereas eight result in amino acid exchanges of the CYP3A4 protein. Out of the 
eight missense variants, seven are new, whereas the M8 variant (M445T) is identical 
to the CYP3A4*3 allele recently described in a Chinese subject (Sata. Clin 
Pharmacol Ther 67 (2000), 48-56). The most frequent missense mutation (G56D) 
was found in 2.82% subjects tested (Table 3). Taken together, 7.5% of Caucasians 
tested in the screen were heterozygous for a variant CYP3A4 protein. 

For CYP3A7 a C1229G SNP in exon 11 was found in 17 out of 232 chromosomes 
screened. The SNP results in a non-conservative amino acid exchange (Thr^Arg) 
at position 409 of the CYP3A7 protein. 
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Example 4: An enzymatic test for the CYP3A7 exon 11 C1229G polymorphism 

The C1229G polymorphism detected in exon 11 of CYP3A7 results in the loss of an 
Alwnl restriction site (Fig. 5A). In accordance with the present invention an Alwnl- 
based test was developed for the genotyping of the C1229G allele. An example is 
shown in Fig. 5B. A digest of the 404 bp genomic fragment amplified from a wild-type 
sample (wt/wt) with primers 3A742F and 3A738R (Table 2) generates two fragments 
of 316 bp and 88 bp, respectively. In a heterozygous sample (wt/C1229G), 
approximately half of the DNA remains undigested, due to the loss of the restrictiori 
site in the mutant allele. 



Because population genetics enables a calculation of the expected frequency of 
homozygous vs. heterozygous alleles of a defined gene (Hardy Weinberg formula, p 
e2 + 2pq + q e2 = 1), it is also possible to confirm the predicted (with that formula) 
distribution of homozygous vs. heterozygous alleles and deviations with the 
experimental findings. This can serve as internal control and confirmation that a 
detected sequence deviation indeed represents a novel allele. 

Example 5: CYP3A4 Mutant Expression, Purification and Western Blotting 

Mutants were expressed in E. coli TOPP3 cells. Cultures of each strain were started 
from single colonies in 2-ml of LB media containing ampicillin (50 ^g/ml) and 
tetracycline (15 ^g/ml) and grown overnight at 37° C with shaking. Larger, 20-ml 
cultures of each sample were inoculated and grown as described for the 2-ml 
cultures. Starter cultures (15 ml) were inoculated into 250-ml of TB media containing 
ampicillin (50 ^g/ml) and grown 2-3 hr at 37° C at 250 rpm. 5-amino!evulinic acid (80 
mg/L) and IPTG (ImM) were added and the cultures grown 72 hr at 30° C at 190 
rpm. The cells were harvested, sonicated and CHAPS solubilized, and the protein 
was purified on Talon Metal Affinity resin from Clontech (Palo Alto, CA). Final P450 
content was measured by reduced CO difference spectra (Omura, J Biol Chem 239 
(1964), 2370-2378), and each protein was visualized on an 8.5% SDS PAGE gel 
stained with Coomassie blue. Western Blot analysis of CYP3A4 wild-type, M2, M5 
and M7 was performed by resolving 1 pmol of 3A4 wildtype and M5. and 8 nl of M2 
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and M7 total protein sample on an 8.5% SDS PAGE gel. The samples were then 
transferred to nitrocellulose and probed with an anti-3A12 polyclonal antibody known 
to cross-react with CYP3A4 (Ciaccio, Arch Blochem Biophys 271 (1989), 284-99). 

Example 6: Functional characterization of CYP3A4 protein variants 

The effect of the protein variants on the expression level and catalytical activity of 
CYP3A4 was investigated using a bacterial CYP3A4 expression system. Sequence 
variants were introduced into the pSE3A4H expression vector (Harlow, J Biol Chem 
272 (1997), 5396-402) which encodes for a CYP3A4 protein identical to that found in 
most Caucasians. No other, unwanted mutations were introduced into the CYP3A4 

J insert or the promoter driving its expression as confirmed by sequencing. Mutants 
M1-M8 were expressed in E. coli as described in Example 5, the expression of the 
CYP3A4 holoenzyme assessed by measuring reduced CO difference spectra, and 
the protein purified. All but M2, M5 and M7 mutants were detectable at levels similar 
to CYP3A4. M5 expressed at levels less than 10% of CYP3A4 and demonstrated 
some instability, as indicated by inconsistency of P450 measurements in multiple 
determinations. M2 and M7 resulted in no detectable P450 holoprotein in four 
attempts. The expression of M2, M5, and M7 was further investigated by Western 
Blotting using an anti-CYP3A12 polyclonal antibody known to cross-react with 
CYP3A4 (Ciaccio, Arch Blochem Biophys 271 (1989), 284-99) as described in 
Example 5. Bands of the expected size (about 54 kDA) were visualized in the lanes 

] containing CYP3A4 wild-type and M2 mutant proteins (Fig. 7). Only a very faint band 
was seen in the lane containing M7, and this band was not significantly darker than 
background. These results indicate that M2 and M7 mutations abolish, whereas M5 
diminishes the expression of CYP3A4 holoenzyme. 

Following the reconstltution with NADPH-cytochrome P450 reductase and 
cytochrome bs, the catalytical activities of M1, M4, M5 and M6 variants were 
measured using testosterone, progesterone and 7-BFC as substrates (Tables 5-7). 
Mutants Ml and M4 displayed steroid hydroxylase activities and 7-BFC debenzylase 
activity that differed from CYP3A4 (Tables 5-7). M4 displayed <50% of wild-type 
CYP3A4 steroid hydroxylase activity at the lower steroid concentration (Tables 5 and 
6), and <50% of CYP3A4 7-BFC debenzylase activity (Table 7). M1 showed 137% of 
CYP3A4 7-BFC debenzylase activity (Table 7) but only 53% of the CYP3A4 
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testosterone hydroxylase activity at a testosterone concentration of 25 jjM (Table 5). 
Neither M1 nor M4 mutant displayed alterations in their steroid hydroxylase 
metabolite profiles when compared with wild-type CYP3A4. The activity of the M5 
mutant was approximately half that of wild-type protein (Table 5-7), although this 
finding could in part reflect M5's apparent reduced stability. On the other hand, 
mutant M6 displayed a strikingly altered testosterone hydroxylase metabolite profile 
(Table 5). 

Table 1. Oligonucleotides used to determine the exon/intron boundaries of the 
CYP3A4 gene and of the exon 1 1 of CYP3A7 (in bold). 



Name 


Position 


Sequence (5' - 3') 


T7 


BAG CS 


TAATACGACTCACTATAGGG 


3A43F 


exon 2 


GAACCCATTCACATGGAC 


3A46R 


exon 4 


TGATCATGTCAGGATCTG 


3A47F 


exon 4 


GGTCAACAGCCTGTGCTG 


3A48R 


exon 5 


TCCACTGGTGAAGGTTGG 


3A49F 


exon 5 


GTGCCATGTCTATAGCTG 


3A410R 


exon 6 


CTTCGCGCCTCAGATTTC 


3A41 1 F 


exon 6 


GAAATGTGAGGGGGGAAG 


3A412R 


exon 7 


GGGTCTTGTGGATTGTTG 


3A413F 


exon 7 


GAAGAATGGAGAAGAGGG 


3A414R 


exon 8 


GTGTATCTTGGAGGGGAG 


3A415F 


exon 8 


GnTGGATTGGTCATCGG 


3A416R 


exon 9 


GCTTTGTGGGAGTGAGnTG 


3A419F 


exon 10 


GGGACTGAGGCTGATGTG 


3A720R 


exon 11 


ATCACCACCCACCCTTTG 


3A721F 


exon 11 


CAAAGGGTGGGTGGTGAT 


3A422R 


exon 12 


GAGAGGAAAGGTCATGGC 


3A423F 


exon 12 


GGCATGAGGTTTGGTGTC 


3A424R 


exon 13 


GGTGGGATGGGTTGAGTC 


3A426R 


exon 2 


GGAGAGGTGTGGGGCGTG 


3A4436F 


intron 8 


GGAGATGAAGGAGCACGCTTGTG 


3A441R 


intron 10 


CTTAGGCTTCTGCGAGTAGCAACG 


CYP3A4PF 


aromoter 


AACAGGCGTGGAAACACAAT 


CYP3A4PR 


promoter 


CTTTGGTGCCCTGCACAG 
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Fifty ng of genomic DNA was added to a reaction mix (total volume 30 or 50 
containing 1x PGR buffer {Q=Qjagen, Cat.Nr. 1005479, or B2=Boehringer (currently 
Roche) Expand Long Template PGR Buffer number 2, Gat, Nr. 1742655), 0.25 /jM 
each oligonucleotide (Metabion), 200 //M dNTPs, and 1 U of Taq polymerase 
(Qiagen). Amplifications were performed on a RoboCycler Gradient 96 (Stratagene) 
with an initial denaturation step of 2 min. at 94 °G followed by 32 amplification cycles 
of denaturation (40 sec, 94 ^C), annealing (45 sec, temperatures 56-60 °G), and 
extension (60-150 sec, 72 °G). This was followed by a final extension step 5 min., 72 
°C The sequencing of PGR fragments and BAG clones was performed on a 
GeneAmp PGR System 9700 (Perkin-Elhier) using a dye-terminator DNA sequencing 
kit (Perkin-Elmer, Gat.Nr. 4303154), according to manufacturers's instructions. 
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Table 2: CYP3A4 (exons 1 to 13) and CYP3A7 (exon 11. bold) polymorphism 



Exon 


Pimer name 


Primer sequence (5 - 3' orientation) 


Location on 
BAG 22300 


Fragment 
length 


Annealing 
temperature 


1 


3A4-62F 


AACTGCAGGCAGAGCACAGGT 


61838-61858 


384 bp 


63**C 


3A4-64R 


CCACGCGCGGCCTGAACATCT 


62221 - 62201 




2" 


3A4-101F 


TAGGATCCAATCATCTCCTAG 


65072 - 65092 




62*'C 


3A4-68F 


GGTGTGTCATGGTGGAGG 


65841 - 65858 


463 bp 


3A4-103R 


AGAGTTAGCAAGAGAGCCCTT 


66303 - 66283 


3 


3A4-50F 


CCTCTAACTGCCAGCAAGTCTG 


67924 - 67945 


249 bp 


58°C 


3A4-51R 


GCGCTGAGACTGTCGTCTGTG 


68172-68152 






3A4-52F 


AGTCTGGCTTCCTGGGTTGGGCTC 


73343 - 73366 


293 bp 


58X 
63°C 


3A4-37R 


GAAGTGGACGTGGAACCTTCCTGGAC 


73635-73610 




3A4-100R 


GGGGACAGGATGAAGTGGACG 


73646 - 73626 


304 bp 


5 


3A4-28F 


TACAACCATGGAGACCTCG 


75813-75832 


236 bp 


62^C 


3A4-29R 


TACCTGTCCCCACCAGATTC 


76048 - 76029 




6 


3A4.57F 


CCCTTTGCAAGGGGTAGTCC 


76066 - 76085 


379 bp 


58°C 


3A4-32R 


GTCTGGTCACTGGAATAACCCAACAGCA 


76444 - 76415 


7 


3A4-33F 


GTCTGTCTTGACTGGACATGTGG 


77509 - 77531 


393 bp 


5S°C 


3A4-34R 


GATGATGGTCACACATATCTTC 


77901 - 77880 


8 


3A4-35F 


GGCTTCCAGTTGAGAACCTTGATGTC 


78723 - 78748 


389 bp 


58°C 


3A4-59R 


GGTCTAAACATGAGCAGTCTTC 


791 1 1 - 79090 


9 


3A4-36F 


GGAGATCAAGGACCAGGCTTGTG 


79584 - 79606 


240 bp 


62°C , 


3A4-47R 


CTCATCATCCTGGAATACTTCCTGC 


79823 - 79799 


10 


3A4-82F 


CCGAGTGTACCTCTGAATTGC 


81959-81979 


431 bp . 


50°C 




3A4-95R 


CAGAGCCTTCCTACATAG 


82389 - 82372 


11 


3A4-97F 


CAGTATGAGTTAGTCTCTGG 


83733 - 83752 


574 bp 


50°C 




3A4-80R 


CATAACTGATGACCTTCATCG " " 


84306 - 84286 


12 


3A4-49F 


CCTGTGTACTGCTAGTAGAGGG 


85020 - 85041 


411 bp 






3A4-39R 


CACAGATGGGCCTAATTG 


35430-85413 


13 


3A4-48F 


GGAGTGTCTCACTCACTTTGATGC i 


37799 - 87822 


288 bp i 


50°C 




3A4-25R 


rGGATGAAGCCCATCTTC I 


38086 - 88069 


11 : 


3A7-42F 1 


:CAGTATGAGTTGTTCTCTGG 1 


>7799 - 87822 


m bp £ 


58°C 




JA7-38R J 


\GGCAGAATATGCTTGAACCAGGC £ 


18086 - 88069 



a 42 

Primer 3A4-101F and 3A4-103R are used for amplification of a 1231 bp 
^ fragment; primer 3A4-68F and 3A4-1 03R are used for sequencing 

Primers 3A4-52F and 3A4-37R can be used in combination for amplification 
resulting in a 293 bp fragment at an annealing temperature of 58°C and primers 
3A4-52F and 3A4-100R can be used in combination for amplification resulting in 
a 304 bp fragment at an annealing temperature of 63°C 

Fifty ng of genomic DNA was added to a reaction mix containing 1x PGR buffer 
(Qiagen), 0.5 ;;IVI oligonucleotides, 200 pM dNTPs, and 1 U of Taq polymerase 
(Qiagen). Amplifications were performed on a RoboCycler Gradient 96 (Stratagene) 
with an initial denaturation step of 2 min. at 94 °C followed by 32 amplification cycles 
of denaturation (40 sec, 94 °C), annealing (45 sec, temperatures given above), and 
extension (60 sec, 72 °C). This was followed by a final extension step 5 min., 72 °C 
All sequencing reactions were performed on a GeneAmp PGR System 9700 (Perkin- 
Elmer) using a dye-terminator DNA sequencing kit (Perkin-Elmer, " catalog 
number4303154), according to manufacturer's instructions. 
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Table 4: Genetic variants of CYP3A4 and CYP3A7. 



Gene Position 
CYP3A4 axon 3 



intron 7 



wt sequence (5'-3') 

F : TCCCAGGGCTTTTGT 
R: ACAAAAGCCCTGGGA 

F : TATCTTTCTCTCTT 
R: AAGAGAGAAAGATA 



mut sequence (5'-3') 

F : TCCCAGGACTTTTGT 
R: ACAAAAGTCCTGGGA 

F : TATCTTGCTCTCTT 
R : AAGAG AGO AAGATA 



exon 5 f : attacgatcat 

R : ATGATCGTAAT 



F : ATTACAATCAT 
R: ATGATTGTAAT 



exon 6 f : agcctgtcacc 

R: GGTGACAGGCT 



F : AGCCTATCACC 
R : GGTGATAGGCT 



exon 6 f : tgaaagagtaa 

R : TTACTCTTTCA 



F : tgaaagagtaa 
R : TTACTGTTTCA 



intron 6 f : gcagccatggg 

R: CCCATGGCTGC 



F : GCAGCTATGGG 
R : CCCATAGCTGC 



intron 6 f : atggggttctg 

R: CAGAACCCCAT 



F : ATGGGTTTCTG 
R : CAGAAACCCAT 



intron 6 f : ccagctgcctg 

R: CAGGCAGCTGG 



F:. CCAGCGGCCTG. 
R: CAGGCCGCTGG 



intron 10 



F: GGATGGTACAT 
R: ATGTACCATCC 



F : GGATGATACAT 
R : ATGTATCATCC 



exon 11 F : TGAAACGCTCA 

, R: TGAGCGTTTCA 



F : TGAAATGCTCA 
R: TGAGCATTTCA 



exon 11 F : GAAACGCTCAG 

R: CTGAGCGTTTC 



F : GAAACACTCAG 
R : CTGAGTGTTTC 



exon 1 1 



F: TGAGACTTGAG 



F : TGAGATTTGAG 
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R : CTCAAGTCTCA 



R : CTCAAATCTC A 



exon 11 F : CCTCCCTGAAA 

R : TTTCAGGGAGG 

intron 11 f : caaggcccctg 

R: CAGGGGCCTTG 

intron 11 f : accaacgtgga 

R: TCCACGTTGGT 

exon 12 F: tggcatgaggt 

R : AC CTC ATGCC A 

exon 13 f : ggcaccgtaag 

R: CTTACGGTGCC 

3'UTR F: ACTTCTGCTTT 

R: AAAGCAGAAGT 

CYP3A7 exon 11 f : tactggacagagc 

R : GCTCTGTCCAGTA 



F : CCTCCTTGAAA 

R : TTTCAAGGAGG 

F : CAAGGTCCCTG 

R : CAGGGACCTTG 

F : ACCAATGTGGA 

R : TCCACATTGGT 

F : TGGCACGAGGT 

R: ACCTCGTGCCA- 

F: GGCACTGTAAG 

R: CTTACAGTGCC 

F : ACTTCGGCTTT 

R: AAAGCCGAAGT 

F : TACTGGAGAGAGC 

R : GCTCTCTCCAGTA 
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Table 5: Testosterone hydroxylase activity of wild-type and mutant CYP3A4 
proteins 



protein 



WT 



M1 



M4 



M5 



M6 



25ijM 
Testosterone 
(nmol product/min/nmol 
protein) 



6j3-OH 



9.3' 



5.1, 4.8"= 



4.4, 3.9 



5.1' 



10.1,8.6 



15/3-OH 



0.4 



0.2, 0.2 



0.1, 0.1 



0.1 



1.7, 1.3 



2/3-OH 



0.2 



<0.1,<0.1 



<0.1,<0.1 



<0.1,<0.1 



3.5, 2.8 



400 pM 
Testosterone 
(nmol product/min/nmol protein) 



6jB-OH 



41.6' 



32.1, 32.2' 



32.4, 29.8 



19.5' 



45.3, 58.8 



150-OH 



1.2 



1.2, 1.1 



1.3, 0.9 



0.7 



5.9, 7.7 



2P-OH 



3.0 



5.4, 6.9 



5.3, 5.6 



3.6 



29.2, 
36.3 



Single values are the average of duplicates from assays performed with one 
preparation only. 

The two values represent separate protein preparations. Each value is the averaae 
of assays performed in duplicate. - ^ 

° Only one preparation of M5 was purified. 



Table 6: Progesterone hydroxylase activity of wild-type and mutant CYP3A4 
proteins 





25 //M 
Progesterone 
(nmol product/min/nmol protein) 


150/yM 
Progesterone 
(nmol product/min/nmol orotGln^ 


Protei 
n 


63-OH 


16a-OH 


6p/16a 


63-OH 


16a-OH 


6|3/16a 


WT 


12.1^ 


1.7 


7.1 


32.1^ 


6.4 


5.0 


Ml 


7.8, 7.9'' 


1.4, .1.4 


5.6, 5.6 


.21.3, 21.3" 


4.2, 4.2 


5.1, 5.1 


M4 


5.5, 5.4 


0.5, 0.5 


11.0, 
10.8 


21.4, 18.4 


3.9, 3.1 


5.5, 5.9 


M5 


. 5.9<= 


0.6 


9.8 


^ 16.6'= 


3.1 


5.4 


M6 


15.8,12.3 


2.1. 1.6 


7.5, 7.7 


40.1, 49.8 


5.1, 6.5 


7.9, 7.7 
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Table 7: 7-Benzyloxy-4-(trifluoromethyl)coumarln (7-BFC) debenzylase activity 
of wild-type and mutant CYP3A4 proteins 



Protein 


7-hydroxy-4-(trifluoromethyl)coumarin 
(nmol product/min/nmol protein) 


WT 


6.5^ 


M1 


8.8, 9.1" 


M4 


2.3, 1.9" 


M5 


2.9° 


M6 


4.5, 5.2 



Single values are the average of duplicates from assays performed with one 
preparation only. 

" The two values represent separate protein preparations. Each value is the a 

of assays performed In duplicate. 
° Only one preparation of M5 was purified. 
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SEQUENCE LISTING 

<110> Epidauros Biotechnologie AG 

<120> Polymorphisms in the human CYP3A4 and CYP3A7 genes and 
their use in diagnostic and therapeutic applications 

<130> D 2145 PCT 

<140> 
<141> 

<150> EP 99 11 8120.7 
<151> 1999-09-10 

<160> 172 

<170> Patentin Ver . 2.1 

<210> 1 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 1 

taatacgact cactataggg 20 

<210> 2 
<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 2 

gaacccattc acatggac 18 

<210> 3 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 3 

tgatcatgtc aggatctg 18 



<210> 4 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 



Express Mail Label 
No, EV 045436952 US 



<400> 4 

ggtcaacagc ctgtgctg 



<210> 5- 
<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 5 

tccactggtg aaggttgg 

<210> 6 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 6 

gtgccatctc tatagctg 



<210> 7 . 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 
<400> 7 

cttcccgcct cagatttc 



<210> 8 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

.<223> Description of Artificial Sequence: artificial 
<400> 8 

gaaatctgag gcgggaag 

<210> 9 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 9 

gggtcttgtg gattgttg 
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<210> 10 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 10 

caacaatcca caagaccc 

<210> 11 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 11 

gtgtatcttc gaggcgac 

<210> 12 

<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 12 

ctttccattc ctcatccc 



<210> 13 

<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 13 

cctttgtggg actcagtttc 

<210> 14 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 14 

gccactcacc ctgatgtc 



<210> 15 
<211> 18 



52 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 15 

atcaccaccc accctttg 

<210> 16 

<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 16 

caaagggtgg gtggtgat 

<210> 17 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 17 

gagagcaaac ctcatgcc 

<210> 18 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 18 

ggcatgaggt ttgctctc 



<210> 19 
<21i> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial . Sequence : artificial 
<400> 19 

ggtgccatcc cttgactc 

<210> 20 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: artificial 

<400> 20 

gcagaggtgt gggccctg 

<210> 21 

<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 21 

ggagatcaag gaccacgctt gtg 

<210> 22 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 
<400> 22 

cttacgcttc tgccagtagc aacc 

<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 23 

aacaggcgtg gaaacacaat 

<210> 24 
<211> 18 

<212> DNA 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 24 

ctttcctgcc ctgcacag 

<210> 25 
<211> 21 

<212> DNA ■ • ' • 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 



<400> 25 

aactgcaggc agagcacagg t 
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<210> 26 
<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 26 

ccacgcccgg cctgaacatc t 

<210> 27 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 27 

taggatccaa tcatctccta c 

<210> 28 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 28 

ggtgtctcat ggtggagg 

<210> 29 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 
<400> 29 

agagttagca agagagccct t 

<210> 30 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 30 

cctctaactg ccagcaagtc tg 
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<210> 31 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 
<400> 31 

gcgctgagac tgtcctctgt g 

<210> 32 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 
<400> 32 

agtctggctt cctgggttgg gctc 

<210> 33 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 33 

ggggacagga tgaagtggac g 

<210> 34 

<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 34 

tacaaccatg gagacctcc 

<210> 35 

<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 



<400> 35 

tacctgtccc caccagattc 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 36 

ccctttccaa ggggtagtcc 

<210> 37 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 37 

gtctggtcac tggaataacc caacagcagg 

<210> 38 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 38 

gtctgtcttg actggacatg tgg 



<210> 39 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 39 

gatgatggtc acacatatct tc 



<210> 40 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 40 

ggcttccagt tgagaacctt gatgtc 

<210> 41 - 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: artificial 
<400> 41 

gctctaaaca tgagcagtct tc 

<210> 42 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 42 ' 
ggagatcaag gaccacgctt gtg 

<210> 43 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 43 

ctcatcatcc tggaatactt cctgc 

<210> 44 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 44 

cccagtgtac ctctgaattg c 

<210> 45 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 45 

cagagccttc ctacatag 

<210> 46 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
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<400> 46 

cagtatgagt tagtctctgg 



<210> 47 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 47 

cataactgat gaccttcatc g 



<210> 48 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 
<400> 48 

cctgtgtact gctagtagag gg 

<210> 49 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 49 

cacagatggg cctaattg 

<210> 50 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 50 

ggagtgtctc actcactttg atgc 

<210> 51 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 



<400> 51 
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tggatgaagc ccatcttc 



<210> 52 
<211> 15 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 52 

tcccagggct tttgt 

<210> 53 
<211> 15 
<212> DNA 

<213> Artificial Sequence 

<220> . ^ 

<223> Description of Artificial Sequence: artificial 

<400> 53 

acaaaagccc tggga 

<210> 54 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 54 

tcccaggact tttgt 



<210> 55 

<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 55 

acaaaagtcc tggga 

<210> 56 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> ' • 

<223> Description of Artificial Sequence: artificial 

<400> 56 
tatctttctc tctt 
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<210> 57 
<211> 14 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 57 
aagagagaaa gata 

<210> 58 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 58 
tatcttgctc tctt 



<210> 59 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 59 
aagagagcaa gata 

<210> 60 
<211> 11 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 60 
attacgatca t 

<210> 61 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 61 
atgatcgtaa t 



<210> 62 
<211> 11 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: artificial 

<400> 62 
attacaatca t 



<210> 63 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 63 
atgattgtaa t 



<210> 64 
<211> 11 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 64 
agcctgtcac c 

<210> 65 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 65 
ggtgacaggc t 

<210> 66 
<211> li 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 66 
agcctatcac c 



<210> 67 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: artificial 

<400> 67 
ggtgataggc t 



<210> 68 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 68 
tgaaagagta a 



<210> 69 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 69 
ttactctttc a 



<210> 70 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 70 
tgaaacagta a 

<210> 71 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 71 
ttactgtttc a 



<210> 72 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 



<400> 72 



gcagccatgg g 



<210> 73 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 73 
cccatggctg c 

<210> 74 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 74 
gcagctatgg g 

<210> 75 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 75 
cccatagctg c 

<210> 76 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 76 
atggggttct g 



<210> 77 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 77 
cagaacccca t 
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<210> 78 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 78 
atgggtttct g 

<210> 79 

<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> - 

<223> Description of Artificial Sequence: artificial 

<400> 79 
cagaaaccca t 



<210> 80 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 80 
ccagctgcct g 

<210> 81 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 81 
caggcagctg g 

<210> 82 ^ 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220>* 

<223> Description of Artificial Sequence: artificial 

<400> 82 
ccagcggcct g 



<210> 83 
<211> 11 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: artificial 



<400> 83 
caggccgctg g 



11 



<210> 84 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 84 

ggatggtaca t ' 11 

<210> 85 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 



<210> 86 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 



<210> 87 . ■ . 

<211> 11 
<212> DNA 

<213> Artificial Sequence- 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 87 

atgtatcatc c . . ■ 11 



<210> 88 
<211> 11 
<212> DNA 

<213> Artificial Sequence 



<400> 85 
atgtaccatc c 



11 



<400> 86 
ggatgataca t 



11 



<220> 
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<223> Description of Artificial Sequence: artificial 



<400> 88 
tgaaacgctc a 

<210> 89 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 89 
tgagcgtttc a 



<210> 90 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 90 
tgaaatgctc a 

<210> 91 • 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 91 
tgagcatttc a 



<210> 92 
<211> 11 
<212> DNA 

<213.> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 92 
gaaacgctca g 

<210> 93 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 



<400> 93 
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ctgagcgttt c 



<210> 94 

<211> 11 

<212> DNA 

<213> Artificial Sequence 
<220> 

•<223> Description of Artificial Sequence: artificial 

<400> 94 
gaaacactca g 

<210> 95 

<211> 11 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 95 
ctgagtgttt c 



<210> 96 

<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 96 
tgagacttga g 



<210> 97 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 97 
ctcaagtctc a 

<210> 98 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 



<400> 98 
tgagatttga g 
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<210> 99 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 99 
ctcaaatctc a 



<210> 100 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> .100 
cctccctgaa a 

<210> 101 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Ar.tificial Sequence: artificial 

<400> 101 
tttcagggag g 

<210> 102 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 102 
cctccttgaa a 

<210> 103 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 103 
tttcaaggag g 



<210> 104 
<211> 11 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 104 
caaggcccct g 



<210> 105 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 105 
caggggcctt g 

<210> 106 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 106 
caaggtccct g 

<210> 107 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial. 

<400> 107 
cagggacctt g 



<210> 108 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 108 
accaacgtgg a 

<210> 109 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: artificial 



<400> 109 
tccacgttgg t 



<210> 110 
<211> 11 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 110 
accaatgtgg a 

<210> 111 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 111 
tccacattgg t 



<210> 112 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 112 
tggcatgagg t 

<210> 113 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 113 
acctcatgcc a 

<210> 114 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 



<400> 114 



tggcacgagg t 



<210> 115 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 

<400> 115 
acctcgtgcc a 



<210> 116 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 

<400> 116 
ggcaccgtaa g 



<210> 117 

<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 

<400> 117 
cttacggtgc c 



<210> 118 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 

<400> 118 
ggcactgtaa g 



<210> 119 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
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11 

Sequence: artificial 

11 

Sequence: artificial 

11 

Sequence: artificial 

11 

Sequence : artificial 

11 

Sequence: artificial 



<400> 119 
cttacagtgc c 



11 
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<210> 120 

<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 120 
acttctgctt t 



<210> 121 
<211> 11 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial 

<400> 121 
aaagcagaag t 

<210> 122 
<211> 11 • 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 122 
acttcggctt t 



<210> 123 
<211> 11 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 123 
aaagccgaag t 

<210> 124 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 124 
tactggacag age 



<210> 125 
<211> 13 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: artificial 
<400> 125 

gctctgtcca gta 13 



<210> 126 
<211> 13 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 

<400> 126 

tactggagag age 13 



<:210> 127 
<211> 13 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 127 

gctctctcca gta 13 



<210> 128 

<211> 249 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> intron 
<222> (1) . . (115) 

<220> 

<221> exon 

<222> (116) . . (168) 

<220> 

<221> intron 
<222> (169) . . (249) 

<220> 

<221> CDS 

<222> (116) . . (166) 

<400> 128 

cctctaactg ccagcaagtc tgatttcatt ggcttcgact gttttcatec caattagagg 60 

cagggttaag tacattaaaa ataataatca aatattattt tgtttctcct cccag grc 118 

Xaa 
1 

^ttt tgt atg ttt gac atg gaa tgt cat aaa aag tat gga aaa gtg tgg 166 
Phe Cys Met Phe Asp Met Glu Cys His Lys Lys Tyr Gly Lys Val Trp 



74 

5 10 • 15 

gggtgagtat tctggaaact tccattggat agacttgttt ctatgatgag tttaccccac 226 
tgcacagagg acagtctcag ccc 249 



<210> 129 

<211> 17 

<212> PRT 

<213> Homo sapiens 



<400> 129 

Xaa Phe Cys Met Phe Asp Met Glu Cys His Lys Lys Tyr -Gly Lys Val 
15 10 15 



Trp 



<210> 130 

<211> 293 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> intron 
<222> (1) . . (77) 

<220> 

<221> exon 

<222> (78) . . (177) 

<220> 

<22 1> intron 
<222> (178) . . (293) 

<220> 
<221> CDS 
.<222> (79) . . (177) 

<400> 130 

agtctggctt cctgggttgg gctccagctg tagaataagg ctgttgatgt ttaatcaact 60 

ctgttttttt cacacagc ttt tat gat ggt caa cag cct gtg ctg get ate 111 

Phe Tyr Asp Gly Gin Gin Pro Val Leu Ala lie 

1 5-10 

aca gat cct gac atg ate aaa aca gtg eta gtg aaa gaa tgt tat tct 159 
Thr Asp Pro Asp Met lie Lys Thr Val Leu Val Lys Glu Cys Tyr Ser 
15 20 25 

gtc ttc aca aac egg agg gtaagcattc atgtgttgaa attaaaatac 207 
Val Phe Thr Asn Arg Arg 
30 

tgattgatta aatttatatt ttgaaattct tatatattca tagacagttg cctaaaaaat 267 
gtccaggaag gttccacgtc cactte 293 



<210> 131' 
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<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 131 

Phe Tyr Asp Gly Gin Gin Pro Val Leu Ala He Thr Asp Pro Asp Met 
15 10 15 

He Lys Thr Val Leu Val Lys Glu Cys Tyr Ser Val Phe Thr Asn Arg 
20 25 30 

Arg 



<210> 132 
<211> 236 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> intron 
<222> (1) . . (61) 

<220> 

<221> exon 

<222> (62) . . (175) 

<220> 

<221> intron 
<222> (176) . . (236) 

<220> 

<221> CDS 

<222> (62) . . -(175) 

<400> 132 

ctacaaccat ggagacctcc acaactgatg taggacaaaa tgtttctgct ttgaactcta 60 

.g cct ttt ggt cca gtg gga ttt atg aaa agt gcc ate tct ata get gag 109 
Pro Phe- Gly Pro Val Gly Phe Met Lys Ser Ala He Ser He Ala Glu 
1 5-10 15 

gat gaa gaa tgg aag aga tta ega tea ttg ctg tct cca acc ttc ace 157 
Asp Glu Glu Trp Lys Arg Leu Arg Ser Leu Leu Ser Pro Thr Phe Thr 
20 25 30 



agt gga aaa etc aag gag gtatgaaaat aacatgagtt ttaataagaa 
Ser Gly Lys Leu Lys Glu 
35 



205 



acttaaagaa tgaatctggt ggggaeaggt a 23 6 



<210> 133 
<211> 38 • 
<212> PRT 

<213> Homo sapiens 
<400> 133 



Pro Phe Gly Pro Val Gly Phe Met Lys Ser Ala He Ser He Ala Glu 
1 5 10 15 
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Asp Glu Glu Trp Lys Arg Leu Arg Ser Leu Leu Ser Pro Thr Phe Thr 
20 25 30 

Ser Gly Lys Leu Lys Glu 
35 



<210> 134 
<211> 393 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> intron 
<222> (1) . . (98) 

<220> 

<221> exon 

<222> (99) . . (247) 

<220> 

<221> intron 
<222> (248) . . (393) 

<220> 

<221> CDS 

<222> (100) . . (246) 



<400> 134. 

gtctgtcttg actggacatg tggctttcct gatgcacgca tagaggaagg atggtaaaaa 60 

ggtgctgatt ttaattttcc acatctttct ccactcagc gtc ttt ggg gcc tac 114 

Val Phe Gly Ala Tyr 
1 5 



age atg gat gtg ate act age aca tea ttt gga gtg aac ate gae tct 162 
Ser Met Asp Val lie Thr Ser Thr Ser Phe Gly Val Asn lie Asp Ser 
10 15 20 



etc aac aat cea caa gac ccc ttt gtg gaa aac acc aag aag ctt tta 210 
Leu Asn Asn Pro Gin Asp Pro Phe Val Glu Asn Thr Lys Lys Leu Leu 
25 30 35 



aga ttt gat ttt ttg gat cca ttc ttt etc tea ata agtatgtgga 256 
Arg Phe Asp Phe Leu Asp Pro Phe Phe Leu Ser lie 
40 45 

ctactattte cttttattta tcttkctetc ttaaaaataa ctgctttatt gagatataaa 316 

tcaccatgta attcatccac ttaaaatata cagttcagtg atttgtagta catttgaaga 376 

tatgtgtgac catcatq 393 



<210> 135 

<211> 49 

<212> PRT 

<213> Homo sapiens 



<400> 135 

Val Phe Gly Ala Tyr Ser Met Asp Val lie Thr Ser Thr Ser Phe Gly 



1 

Val Asn lie Asp 
20 

.Thr Lys Lys Leu 
35 

He 



5 

Ser Leu Asn Asn 

Leu Arg Phe Asp 
40 
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10 

Pro Gin Asp Pro 
25 

Phe Leu Asp Pro 



Phe Val Glu Asn 
30 

Phe Phe Leu Ser 
45 



<210> 136 

<211> 240 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> intron 

<222> (1) . . (82) 

<220> 

<221> axon 

<222> (83) . . (149) 

<220> 

<221> intron 
<222> (150) . . (240) 

<220> 

<221> CDS 

<222> (83) . . (148) 

<400> 136 

ggagatcaag gaccacgctt gtgatttact tctgacttca ggagccactt tctgtcagtg 60 

aaatttctct ttttgcttct ag cac cga gtg gat ttc ctt cag ctg atg att 112 

His Arg Val Asp Phe Leu Gin Leu Met He 
15 10 

gac tct cag aat tea aaa gaa act gag tec cac aaa ggtaaccaga 158 
Asp Ser Gin Asn Ser Lys Glu Thr Glu Ser His Lys 
15 20 

gtgtttctga gggctacttg tggggcactc agagggaagg ccttgttctg aaaatgtgca 218 

ggaagtattc caggatgatg ag 240 



<210> 137 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 137 . 

His Arg Val Asp Phe Leu Gin Leu Met He Asp Ser Gin Asn Ser Lys 
i 5 10 15 



Glu Thr Glu Ser His Lys 
20 
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<210> 138 
<211> 399 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> intron 
<222> (1) . . (Ill) 

<220> 

<221> exon 

<222> (112) . . (338) 

<220> 

<221> intron 
<222> (339) . . (399) 

<220> 

<221> CDS 

<222> (112) . . (336) 

<400> 138 

ccagtatgag ttgttctctg gaacttctaa cagttcaaca gtactacatg gactgagtta 60 

aaagttaatt caaaaatctc aatttatcca aatctgtttc tttcttttca g gca cca 117 

Ala Pro 
1 

ccc acc tat gat act gtg eta cag atg gag tat ctt gac atg gtg gtg 165 
Pro Thr Tyr Asp Thr Val Leu Gin Met Glu Tyr Leu Asp Met Val Val 
5 10 15 . 

aat gaa acg etc aga tta ttc cca att get atg aga ctt gag agg gtc 213 
Asn Glu Thr Leu Arg Leu Phe Pro He Ala Met Arg Leu Glu Arg Val 
20 25 30 

tgc aaa aaa gat gtt gag ate aat ggg atg ttc att ccc aaa ggg tgg 2 61 
Cys Lys Lys Asp Val Glu He Asn Gly Met Phe He Pro Lys Gly Trp 
35 40 45 50 

gtg gtg atg att cca age tat get ctt cac cgt gac cca aag tac tgg 309 
Val Val Met He Pro Ser Tyr Ala Leu His Arg Asp Pro Lys Tyr Trp 
55 60 65 

asa gag cet gag aag ttc etc cct gaa aggtaggagg cccctgggaa 356 
Xaa Glu Pro Glu Lys Phe Leu Pro Glu 
70 75 

gggagcecte ectgaaecag cctggttcaa geatattctg cct 399 

<210> 139 

<211> 75 

<212> PRT 

<213> Homo sapiens 

<400> 139 

Ala Pro Pro Thr Tyr Asp Thr Val Leu Gin Met Glu Tyr Leu Asp Met 
15 10 15 



Val Val Asn Glu Thr Leu Arg Leu Phe Pro He Ala Met Arg Leu Glu 
20 25 30 
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Arg Val Cys Lys Lys Asp Val Glu He Asn Gly Met Phe He Pro Lys 

35 40 45 

Gly Trp Val Val Met He Pro Ser Tyr Ala Leu His Arg Asp Pro Lys 

50 55 60 

Tyr Trp Xaa Glu Pro Glu Lys Phe Leu Pro Glu 

65 70 75 



<210> 140 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 140 

ccagtatgag ttgttctctg g 21 



<210> 141 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
<400> 141 

aggcagaata tgcttgaacc aggc 24 



<210> 142 
<211> 26 
<212> DNA 

■<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 142 

gaagtggacg tggaaccttc ctggac 26 

<210> 143 
<211> 304 
<212> DNA 
<213> Homo sapiens 

<400> 143 

agtctggctt cctgggttgg gctccagctg tagaataagg ctgttgatgt ttaatcaact 60 
ctgttttttt cacacagctt ttatgatggt caacagcctg tgctggctat cacagatcct 120 
gacatgatca aaacagtgct agtgaaagaa tgttattctg tcttcacaaa ccggagggta 180 
agcattcatg tgttgaaatt aaaatactga ttgattaaat ttatattttg aaattcttat 240 
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atattcatag acagttgcct aaaaaatgtc caggaaggtt ccacgtccac ttcatcctgt 300 
cccc 



<210> 144 
<211> 236 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (62) . . (175) 

<220> 

<221> intron 
<222> (1) . . (61) 

<220> 

<221> intron 
<222> (176) . . (236) 

<220> 

<221> exon 

<222> (62) . . (175) 



<400> 144 

ctacaaccat ggagacctcc acaactgatg taggacaaaa tgtttctgct ttgaactcta 60 

g cct ttt ggt cca gtg gga ttt atg aaa agt gcc ate tct ata get gag 109 
Pro Phe Gly Pro Val Gly Phe Met Lys Ser Ala lie Ser lie Ala Glu 
1 5 10 15 

gat gaa gaa tgg aag aga tta caa tea ttg ctg tct cca acc ttc ace 157 
Asp Glu Glu Trp Lys Arg Leu Gin Ser Leu Leu Ser Pro Thr Phe Thr 
20 25 30 

agt gga aaa etc aag gag gtatgaaaat aacatgagtt ttaataagaa 205 
Ser Gly Lys Leu Lys Glu 
35 

acttaaagaa tgaatctggt ggggacaggt a 23 6 



<210> 145 
<211> 38 
<212> PRT 

<213> Homo sapiens 
<400> 145 

Pro Phe Gly Pro Val Gly Phe Met 
1 5 

Asp Glu Glu Trp Lys Arg Leu Gin 
20 



Lys Ser Ala lie Ser lie Ala Glu 
10 15 

Ser Leu Leu Ser Pro. Thr Phe Thr 
25 30 



Ser Gly Lys Leu Lys Glu 
35 
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<210> 146 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (188) . . (274) 

<400> 146 

ccctttccaa ggggtagtcc actgaatttg agctgcctaa aaatggtctt ttatctttat 60 

gtacagaaaa cacatcacaa aattcattat aaaatgtcac ttactgctcc atgctgggga 120 

aagceatgtc cttctgggac tagagtctgc acatttaact atgggtggtg ttgtgttttg 180 

tgcttag atg gtc cct ate att gcc cag tat, gga gat gtg ttg gtg aga 229 
Met Val Pro lie lie Ala Gin Tyr Gly Asp Val Leu Val Arg 
1 5 -10 

aat ctg agg egg gaa gca gag aca ggc aag cct ate ace ttg aaa 274 
Asn Leu Arg Arg Glu Ala Glu Thr Gly Lys Pro lie Thr Leu Lys 
15 20 25 

gagtaagtag aagcgcagcc atggggttct gagctgtcat gaacccctcc agctgcctgc 334 

catggagctg atattcctgc tgttgggtta ttccagtgac cagac 379 



<210> 147 
<211> 29 
<212> PRT 

<213> Homo sapiens 
<400> 147 

Met Val Pro lie lie Ala Gin Tyr 
1 5 

Arg Arg Glu Ala Glu Thr Gly Lys 
20 



Gly Asp Val Leu Val Arg Asn Leu 
10 15 

Pro lie Thr .Leu Lys 
25 



<210> 148 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS - 

<222> (188) , . (274) 

<400> 148 

ccctttccaa ggggtagtcc actgaatttg agctgcctaa aaatggtctt ttatctttat 60 
gtacagaaaa cacatcacaa aattcattat aaaatgtcac ttactgctcc atgctgggga 120 
aagceatgtc cttctgggac tagagtctgc acatttaact atgggtggtg ttgtgttttg 180 



tgcttag atg gtc cct ate att gcc cag tat gga gat gtg ttg gtg aga 229 
Met Val Pro lie lie Ala Gin Tyr Gly Asp Val Leu Val Arg 
15 10 
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aat ctg agg egg gaa gca gag aca ggc aag cct gtc acc ttg aaa 274 
Asn Leu Arg Arg Glu Ala Glu Thr Gly Lys Pro Val Thr Leu Lys 
15 20 25 

cagtaagtag aagcgcagcc atggggttct gagctgtcat gaacccctcc agctgcctgc 334 

catggagctg atattcctgc tgttgggtta ttccagtgac cagac 379 



<210> 149 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 149 

Met Val Pro lie lie Ala Gin Tyr Gly Asp Val Leu Val Arg Asn Leu 
1 5 10 15 

Arg Arg Glu Ala Glu Thr Gly Lys Pro Val Thr Leu Lys 
20 25 



<210> 150 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<400> 150 

ccctttccaa ggggtagtcc actgaatttg 
gtacagaaaa cacatcacaa aattcattat 
aagccatgtc cttctgggac tagagtctgc 
tgcttagatg gtccctatca ttgcccagta 
ggaagcagag acaggcaagc ctgtcacctt 
ttctgagctg tcatgaaccc ctccagctgc 
gttattccag tgaccagac 



agctgcctaa aaatggtctt ttatctttat 60 
aaaatgtcac ttactgctcc atgctgggga 12 0 
acatttaact atgggtggtg ttgtgttttg 180 
tggagatgtg ttggtgagaa atctgaggcg 240 
gaaagagtaa gtagaagcgc agctatgggg 300 
ctgccatgga gctgatattc ctgctgttgg 360 

379 



<210> 151 
<211> 379 
•<212> DNA 
<213> Homo sapiens 

<400> 151 

ccctttccaa ggggtagtcc actgaatttg 

gtacagaaaa cacatcacaa aattcattat 

aagccatgtc cttctgggac tagagtctgc 

tgcttagatg gtccctatca ttgcccagta 

ggaagcagag acaggcaagc ctgtcacctt 

ttctgagctg tcatgaaccc ctccagctgc 
gttattccag tgaccagac 



agctgcctaa aaatggtctt ttatctttat 60 
aaaatgtcac ttactgctcc atgctgggga 120 

acatttaact atgggtggtg ttgtgttttg 180 
tggagatgtg ttggtgagaa atctgaggcg 240 
gaaagagtaa gtagaagcgc agccatgggt 3 00 
ctgccatgga gctgatattc ctgctgttgg 360 

379 



<210> 152 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<400> 152 

ccctttccaa ggggtagtcc actgaatttg 
gtacagaaaa cacatcacaa aattcattat 
aagccatgtc cttctgggac tagagtctgc 
tgcttagatg gtccctatca ttgcccagta 



agctgcctaa aaatggtctt ttatctttat 60 
aaaatgtcac ttactgctcc atgctgggga 120 
acatttaact atgggtggtg ttgtgttttg 180 
tggagatgtg ttggtgagaa atctgaggcg 240 
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ggaagcagag acaggcaagc ctgtcacctt gaaagagtaa gtagaagcgc agccatgggg 300 
ttctgagctg tcatgaaccc ctccagcggc ctgccatgga gctgatattc ctgctgttgg 3 60 
gttattccag tgaccagac 379 



<210> 153 
<211> 431 
<212> DNA 
<213> Homo sapiens 



<400> 153 

cccagtgtac 

tagatttctc 

ttcctctcct 

ttgctggcta 

accc tgatgt 

gtggatgata 

acttcccagg 

ggaaggctct 



ctctgaattg 
ttcatctaaa 
ttcagctctg 
tgaaaccacg 
ccagcagaaa 
catggagaag 
agaattttta 

g 



cttttctatt 
ctgtgatgcc 
tccgatctgg 
agcagtgttc 
ctgcaggagg 
gagggaggag 
taaaaagcat 



cttttccctt 
ctacattgat 
agctcgtggc 

tctccttcat 
aaattgatgc 
gtgaaacctt 
aatcactgat 



agggatttga 
ctgatttacc 
ccaatcaatt 

tatgtatgaa 
agttttaccc 
agcaaaaatg 
tctttcactg 



gggcttcact 60 
taaaatgtct 120 
atctttattt 180 

ctggccactc 240 
aataaggtga 3 00 
cctcctcacc 360 
actctatgta 420 
431 



<210> 154 
<211> 574 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (110) . . (334) 

<400> 154 

cagtatgagt tagtctctgg agctcctaat acttcattag tactgcatgg actgagttaa 60 

aagttaattc aaaatctcaa tttatccaaa tctgtttcgt tctttccag gca cca ccc 118 

Ala Pro Pro 
1 

acc tat gat act gtg eta cag atg gag tat ctt gac atg gtg gtg aat 166 
Thr Tyr Asp Thr Val Leu Gin Met Glu Tyr Leu Asp Met Val Val Asn 
5 10 . 15 

gaa atg etc aga tta ttc cca att get atg aga ctt gag agg gtc tgc 214 
Glu Met Leu Arg Leu Phe Pro lie Ala Met Arg Leu Glu Arg Val Cys 
20 25 30 35 

aaa aaa gat gtt gag ate aat ggg atg ttc att cce aaa ggg gtg gtg 262 
Lys Lys Asp Val Glu lie Asn Gly Met Phe lie Pro Lys Gly Val Val 
40 45 50 

gtg atg att cca age tat get ctt cac cgt gac cca aag tac tgg aea 310 
Val Met lie Pro Ser Tyr Ala Leu His Arg Asp Pro Lys Tyr Trp Thr 
55 -60 65 

gag cet gag aag ttc etc cct gaa aggtaeaagg cccctgggaa gggagccctc 364 
Glu Pro Glu Lys Phe Leu Pro Glu 
70 75 

cctgaaccag cctggttcaa gcatattctg cctctettaa tctacaggac agtcatgtgg 424 

ttgtataatt atttgcttgt atttttatat ttagagattt ttttaatcat caaattgatt 484 



attgtcacac tttacaaacc atagactaga aaaaagaaaa ctacagtcat ccacaattcc 544 
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aacaacttac gatgaaggtc atcagttatg 



574 



<210> 155 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<400> 155 

Ala Pro Pro Thr Tyr Asp Thr Val 
1 5 

Val Val Asn Glu Met Leu Arg Leu 
20 

Arg Val Cys Lys Lys Asp Val Glu 
35 40 

Gly Val Val Val Met lie Pro Ser 
50 55 

Tyr Trp Thr Glu Pro Glu Lys Phe 
65 70 



Leu Gin Met Glu Tyr Leu Asp Met 
10 15 

Phe Pro lie Ala Met Arg Leu Glu 
25 - 30 

lie Asn Gly Met Phe lie Pro Lys 
45 

Tyr Ala Leu His Arg Asp Pro Lys 
60 

Leu Pro Glu 
75 



<210> 156 
<211> 574 
<212> DNA 
<213> Homo sapiens 



<400> 156 

cagtatgagt tagtctctgg agctcctaat acttcattag tactgcatgg actgagttaa 60 
aagttaattc aaaatctcaa tttatccaaa tctgtttcgt tctttccagg caccacccac 120 
ctatgatact gtgctacaga tggagtatct tgacatggtg gtgaatgaaa cactcagatt 180 
attcccaatt gctatgagac ttgagagggt ctgcaaaaaa gatgttgaga tcaatgggat 240 
gttcattccc aaaggggtgg tggtgatgat tccaagctat gctcttcacc gtgacccaaa 3 00 
gtactggaca gagcctgaga agttcctccc tgaaaggtac aaggcccctg ggaagggagc 3 60 
cctccctgaa ccagcctggt tcaagcatat tctgcctctc ttaatctaca ggacagtcat 420 
gtggttgtat aattatttgc ttgtattttt atatttagag atttttttaa tcatcaaatt 480 
gattattgtc acactttaca aaccatagac tagaaaaaag aaaactacag tcatccacaa 540 
ttccaacaac ttacgatgaa ggtcatcagt tatg 574 



<210> 157 
<211> 574 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (110) . . (334) 

<400> 157 

cagtatgagt tagtctctgg agctcctaat acttcattag tactgcatgg actgagttaa 60 

aagttaattc aaaatctcaa tttatccaaa tctgtttcgt tctttccag gca cca ccc 118 

Ala Pro Pro 
1 



acc tat gat act gtg eta cag atg gag tat ctt gac atg gtg gtg aat 166 
Thr Tyr Asp Thr Val Leu Gin Met Glu Tyr Leu Asp Met Val Val Asn 
5 10 15 



85 



gaa acg etc aga tta ttc cca att get atg aga- ttt gag agg gtc tgc 214 
Glu Thr Leu Arg Leu Phe Pro lie Ala Met Arg Phe Glu Arg Val Cys 
20 25 30 35 

aaa aaa gat gtt gag ate aat ggg atg ttc att ccc aaa ggg gtg gtg 2 62 
Lys Lys Asp Val Glu lie Asn Gly Met Phe He Pro Lys Gly Val Val 
40 45 50 

gtg atg att cca age tat get ctt cac cgt gae cca aag tac tgg aea 310 
Val Met He Pro Ser Tyr Ala Leu His Arg Asp Pro Lys Tyr Trp Thr 
55 60 65 

gag cct gag aag ttc etc cct gaa aggtacaagg ceectgggaa gggagecetc 364 
Glu Pro Glu Lys Phe Leu Pro Glu 
70 75 

cctgaaccag cctggttcaa gcatattctg cctetettaa tetacaggac agtcatgtgg 424 

ttgtataatt atttgcttgt atttttatat ttagagattt ttttaatcat eaaattgatt 484 

attgtcacae tttacaaaee atagactaga aaaaagaaaa ctacagteat ccacaattcc 544 

aacaaettac gatgaaggte ateagttatg 574 

<210> 158 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<400> 158 

Ala Pro Pro Thr Tyr Asp Thr Val Leu Gin Met Glu Tyr Leu Asp Met 
1.5 10 15 

Val Val Asn Glu Thr Leu Arg Leu Phe Pro He Ala Met Arg Phe Glu 
20 25 30 

Arg Val Cys Lys Lys Asp Val Glu He Asn Gly Met Phe He Pro Lys 
35 40 45 

Gly Val Val Val Met He Pro Ser Tyr Ala Leu His Arg Asp Pro Lys 
50 55 60 

Tyr Trp Thr Glu Pro Glu Lys Phe Leu Pro Glu 
65 70 75 



<210> 159 
<211> 574 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (110) . . (334) 

<400> 159 

cagtatgagt tagtctctgg agctcetaat acttcattag taetgcatgg actgagttaa 60 

aagttaattc aaaatetcaa tttatccaaa tctgtttegt tctttceag gea cca ccc 118 

Ala Pro Pro 



86 



1 



acc tat gat act gtg eta cag atg gag tat ctt gac atg gtg gtg aat 166 
Thr Tyr Asp Thr Val Leu Gin Met Glu Tyr Leu Asp Met Val Val Asn 
5 10 . ' 15 

gaa acg etc aga tta ttc cca att get atg aga ctt gag agg gtc tgc 214 
Glu Thr Leu Arg Leu Phe Pro lie Ala Met Arg Leu Glu Arg Val Cys 
20 25 30 35 

aaa aaa gat gtt gag ate aat ggg atg ttc att ccc aaa ggg gtg gtg 2 62 
Lys Lys Asp Val Glu lie Asn Gly Met Phe lie Pro Lys Gly Val Val 
40 45 50 

gtg atg att cca age tat get ctt cac cgt gac cca aag tac tgg aea 310 
Val Met lie Pro Ser Tyr Ala Leu His Arg Asp Pro Lys Tyr Trp Thr 
55 60 65 

gag cct gag aag ttc etc ctt gaa aggtacaagg eccctgggaa gggagceetc 3 64 
Glu Pro Glu Lys Phe Leu Leu Glu 
70 75 

ectgaaccag cctggttcaa geatattetg ectctcttaa tctaeaggac agteatgtgg 424- 

ttgtataatt atttgcttgt atttttatat ttagagattt ttttaatcat caaattgatt 484 

attgtcacac tttacaaacc atagactaga aaaaagaaaa ctaeagteat ceaeaattcc 544 

aacaacttac gatgaaggtc atcagttatg 574 



<210> 160 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<400> 160 

Ala Pro Pro Thr Tyr Asp Thr Val Leu Gin Met Glu Tyr Leu Asp Met 
15 10 15 

-Val Val Asn Glu Thr Leu Arg Leu Phe Pro lie Ala Met Arg Leu Glu 
20 25 30 

Arg Val Cys Lys Lys Asp Val Glu lie Asn Gly Met Phe lie Pro Lys 

35 40 45 . . 

Gly Val Val Val Met lie Pro Ser Tyr Ala Leu His Arg Asp Pro Lys 
50 55 60 

Tyr Trp Thr Glu Pro Glu Lys Phe Leu Leu Glu 
65 70 75 



<210> 161 
<211> 574 
<212> DNA 

<213> Homo sapiens 
<400> 161 

cagtatgagt tagtetctgg agetcetaat actteattag tactgeatgg actgagttaa 60 
aagttaattc aaaatctcaa tttatecaaa tetgtttcgt tctttecagg eaccaceeae 120 
ctatgatact gtgctacaga tggagtatct tgacatggtg gtgaatgaaa cgctcagatt 180 



attcccaatt gctatgagac ttgagagggt 
gttcattccc aaaggggtgg tggtgatgat 
gtactggaca gagcctgaga agttcctccc 
cctccctgaa ccagcctggt tcaagcatat 
gtggttgtat aattatttgc ttgtattttt 
gattattgtc acactttaca aaccatagac 
ttccaacaac ttacgatgaa ggtcatcagt 
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ctgcaaaaaa gatgttgaga tcaatgggat 240 
tccaagctat gctcttcacc gtgacccaaa 3 00 
tgaaaggtac aaggtccctg ggaagggagc 3 60 
tctgcctctc ttaatctaca ggacagtcat 420 
atatttagag atttttttaa tcatcaaatt 480 
tagaaaaaag aaaactacag tcatccacaa 540 
tatg 574 



<210> 162 
<211> 411 
<212> DNA 

<213> Homo sapiens 



<400> 162 

cctgtgtact actagttgag gggtggcccc 
ggggtattat gtcattaact ttttaaaaat 
gaacaaggac aacatagatc cttacatata 
cattggcatg aggtttgctc tcatgaacat 
cttctccttc aaaccttgta aagaaacaca 
gtattaataa ttcttttaac tgagtggtct 
tcttttacta atttgttctc tgggccaaag 



taagtaagaa accctaacat gtaactctta 60 
ctaccaatgt ggaaccagat tcagcaagaa 120 
cacacccttt ggaagtggac ccagaaactg 180 
gaaacttgct ctaatcagag tccttcagaa 240 
ggttagtcaa ttttctataa aaataatgtt 3 00 
gtatttttta aaaagaatat gcttgtttaa 3 60 
aatcaattag gcccatctgt g 411 



<210> 163 
<211> 288 
<212> DNA 

<213> Homo sapiens 



<400> 163 

ggagtgtctc actcactttg atgctatact 
tatgcttgtt taactgttgc agatccccct 
agaaaaaccc gttgttctaa aggttgagtc 
tttcctaagg acttctgctt tgctcttcaa 
caaattactt tgtgaataga actctgaaat 



ttctactttt gtttatttaa tgcttctcaa 60" 
gaaattaagc ttaggaggac ttcttcaacc 120 
aagggatggc actgtaagtg gagcctgaat 180 
gaaatctgtg cctgagaaca ccagagacct 24 0 
gaagatgggc ttcatcca 288 



<210> 164 

<211> 288 
<212> DNA 

<213> Homo sapiens 



<400> 164 

ggagtgtctc actcactttg atgctatact 
tatgcttgtt taactgttgc agatccccct 
agaaaaaccc gttgttctaa aggttgagtc 
tttcctaagg acttcggctt tgctcttcaa 
caaattactt tgtgaataga actctgaaat 



ttctactttt gtttatttaa tgcttctcaa 60 
gaaattaagc ttaggaggac ttcttcaacc 12 0 
aagggatggc accgtaagtg gagcctgaat 180 
gaaatctgtg cctgagaaca ccagagacct 240 
gaagatgggc ttcatcca 288 



<210> 165 
<211> 236 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> r=g or .a 
<400> 165 

ctacaaccat ggagacctcc acaactgatg 
gccttttggt ccagtgggat ttatgaaaag 
gaagagatta cratcattgc tgtctccaac 
aaaataacat gagttttaat aagaaactta 



taggacaaaa tgtttctgct ttgaactcta 60 
tgccatctct atagctgagg atgaagaatg 120 
cttcaccagt ggaaaactca aggaggtatg 180 
aagaatgaat ctggtgggga caggta 236 
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<210> 166 
<211> 379 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> r=:g or a, y=t or c, s=g or c, k=g or t 



<400> 166 
ccctttccaa 
gtacagaaaa 
aagccatgtc 

tgcttagatg 
ggaagcagag 
ttctgagctg 
gttattccag 



ggggtagtcc 
cacatcacaa 
cttctgggac 
gtccctatca 
acaggcaagc 
tcatgaaccc 
tgaccagac 



actgaatttg 
aattcattat 
tagagtctgc 
ttgcccagta 
ctrtcacctt 
ctccagckgc 



agctgcctaa 
aaaatgtcac 
acatttaact 
tggagatgtg 
gaaasagtaa 
ctgccatgga. 



aaatggtctt 
ttactgctcc 
atgggtggtg 
ttggtgagaa 
gtagaagcgc 
gctgatattc 



ttatctttat 60 
atgctgggga 120 
ttgtgttttg 180 
atctgaggcg 240 
agcyatgggk 300 
ctgctgttgg 360 
379 



<210> 167 
<211> 431 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> r=g or a 



<400> 167 

cccagtgtac 

tagatttctc 

ttcctctcct 

ttgctggcta 

accctgatgt 

gtggatgrta 

acttcccagg 

ggaaggctct 



ctctgaattg 
ttcatctaaa 

ttcagctctg 
tgaaaccacg 
ccagcagaaa 
catggagaag 
agaattttta 
g 



cttttctatt 
ctgtgatgcc 
tccgatctgg 
agcagtgttc 
ctgcaggagg 
gagggaggag 
taaaaagcat 



cttttccctt 
ctacattgat 

agctcgtggc 
tctccttcat 
aaattgatgc 
gtgaaacctt 
aatcactgat 



agggatttga 
ctgatttacc 
ccaatcaatt 
tatgtatgaa 
agttttaccc 
agcaaaaatg 
tctttcactg 



gggcttcact 
taaaatgtct 
atctttattt 
ctggccactc 
aataaggtga 
cctcctcacc 
actctatgta 



60 

120 

180 

240 

300 

360 

420 

431 



<210> 168 
<211> 574 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> y=t or c, r=g or a 
<400> 168 

cagtatgagt tagtctctgg agctcctaat 
aagttaattc aaaatctcaa tttatccaaa 
ctatgatact gtgctacaga tggagtatct 
attcccaatt gctatgagay ttgagagggt 
gttcattccc aaaggggtgg tggtgatgat 
gtactggaca gagcctgaga agttcctccy 
cctccctgaa ccagcctggt tcaagcatat 
gtggttgtat aattatttgc ttgtattttt 
gattattgtc acactttaca aaccatagac 
ttccaacaac ttacgatgaa ggtcatcagt 



acttcattag tactgcatgg actgagttaa 60 
tctgtttcgt tctttccagg caccacccac 120 
tgacatggtg gtgaatgaaa yrctcagatt 180 
ctgcaaaaaa gatgttgaga tcaatgggat 240 
tccaagctat gctcttcacc gtgacccaaa 3 00 
tgaaaggtac aaggyccctg ggaagggagc 3 60 
tctgcctctc ttaatctaca ggacagtcat 420 
atatttagag atttttttaa tcatcaaatt 480 
tagaaaaaag aaaactacag tcatccacaa 540 
tatg 574 



<210> 169 
<211> 411 
<212> DNA 



<213> Homo sapiens 
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<220> 

<223> y=:t or c 



<400> 169 

cctgtgtact 

ggggtattat 

gaacaaggac 

cattggcatg 

cttctccttc 

gtattaataa 

tcttttacta 



actagttgag 
gtcattaact 
aacatagatc 
aggtttgctc 
aaaccttgta 
ttcttttaac 
atttgttctc 



gggtggcccc 
ttttaaaaat 
cttacatata 

tcatgaacat 
aagaaacaca 
tgagtggtct 
tgggccaaag 



taagtaagaa 
ctaccaaygt 
cacacccttt 

gaaacttgct 
ggt tagtcaa 
gtatttttta 
aatcaattag 



accctaacat 
ggaaccagat 
ggaagtggac 

ctaatcagag 
ttttctataa 
aaaagaatat 
gcccatctgt 



gtaactctta 60 
tcagcaagaa 120 
ccagaaactg 180 

tccttcagaa 240 
aaataatgtt 300 
gcttgtttaa 360 
g 411 



<210> 170 

<211> 288 
<212> DNA 

<213> Homo sapiens 
<220> . 

<223> y=t or c, k=g or t 

<400> 170 

ggagtgtctc actcactttg atgctatact ttctactttt gtttatttaa tgcttctcaa 60 
tatgcttgtt taactgttgc agatccccct gaaattaagc ttaggaggac ttcttcaacc 120 
agaaaaaccc gttgttctaa aggttgagtc aagggatggc acygtaagtg gagcctgaat 180 
tttcctaagg acttckgctt tgctcttcaa gaaatctgtg cctgagaaca ccagagacct 240 
caaattactt tgtgaataga actctgaaat gaagatgggc ttcatcca 288 



<210> 


171 




<211> 


30 




<212> 


DNA 




<213> 


Homo 


sapiens 


<220> 






<221> 


CDS 




<222> 


(1) . . 


. (30) 


<400> 


171 





cct gtc acc ttg aaa cac gtc ttt ggg gcc 3 0 

Pro Val Thr Leu Lys His Val Phe Gly Ala 
1 5 10 



<210> 172 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 172 . 

Pro Val Thr Leu Lys His Val Phe Gly Ala 
15 10 



